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(57) Abstract 

Viral proteins derived from an enterically transmitted non-A/ 
non-B viral hepatitis agent are disclosed. In one embodiment, the pro- 
tein is immunologically reactive with antibodies present in individuals 
infected with the viral hepatitis agent This protein is useful in a diag- 
nostic method for detecting infection by the enterically transmitted 
agent Also disclosed are DNA probes derived from a cloned se- 
quence of the viral agent. These probes are useful for identifying and 
sequencing the entire viral agent and for assaying the presence of the 
viral agent in an infected sample, using probe-specific amplification 
of virus-derived DNA fragments. 
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ENTERICALLY TRANSMITTED NON-A/NON-B 
5 HEPATITIS VIRAL AGENT 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of 
10 U.S. Application Serial No. 336,672, filed April 11, 
1989, which is a continuation-in-part of U.S. 
Application Serial No. 208,997, filed June 17, 1988, 
both of which are herein incorporated by reference. 

15 INTRODUCTION 

Field of Invention 

This invention relates to recombinant 
proteins, genes, and gene probes and more specifically 
20 to such proteins and probes derived from an enterically 
transmitted nonA/nonB hepatitis viral agent, and to 
diagnostic methods and vaccine applications which 
employ the proteins and probes. 

25 Background 

Enterically transmitted non-A/non-B (ET-NANB) 
hepatitis viral agent is the reported cause of 
hepatitis in several epidemics and sporadic cases in 
Asia, Africa, and the Indian subcontinent. Infection 

30 is usually by water contaminated with feces, although 
the virus may also spread by close physical contact. 
The virus does not seem to cause chronic infection. 
The viral etiology in ET-NANB has been demonstrated by 
infection of volunteers with pooled fecal isolates; 

35 Immune electron microscopy ( IEM) studies have shown 

virus particles with 27-34 nm diameters in stools from 
infected individuals. The virus particles reacted with 
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antibodies in serum from infected individuals from 
geographically distinct regions, suggesting that a 
single viral agent or class is responsible for the 
majority of ET-NANB hepatitis seen worldwide. No 
antibody reaction was seen in serum from individuals 
infected with blood-transmitted NANB virus, indicating 
a different specificity between the two NANB types. 

In addition to serological differences, the 
two types of NANB infection show distinct clinical 
dif-f erences . ET-NANB is characteristically an acute 
infection, often associated with fever and arthralgia, 
and with portal inflammation and associated bile stasis 
in liver biopsy specimens (Arankalle). Symptoms are 
usually resolved within six weeks. Blood- transmitted 
15 NANB, by contrast, produces a chronic infection in 
about 50% of the cases. Fever and arthralgia are 
rarely seen, and inflammation has a predominantly 
parenchymal distribution (Khuroo, 1980). The two viral 
agents can also be distinguished on the basis of 
primate host susceptibility. ET-NANB, but not the 
blood-transmitted agent, can be transmitted to 
cynomolgus monkeys. The blood-transmitted agent is 
more readily transmitted to chimpanzees than is ET-NANB 
(Bradley, 1987). 

There have been major efforts worldwide to 

identify and clone viral genomic sequences associated 

with ET-NANB hepatitis. One goal of this effort, 

requiring virus-specific genomic sequences, is to 

identify and characterize the nature of the virus and 

its protein products. Another goal is to produce 

recombinant viral proteins which can be used in 

antibody-based diagnostic procedures and for a 

vaccine. Despite these efforts, viral sequences 

associated with ET-NANB hepatitis have not been 

successfully identified or cloned heretofore, nor have 

any virus-specific proteins been identified or 
produced. 



20 



25 



30 



35 
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SUMMARY OF THE INVENTION 

25 Novel compositions, as well as methods of 

preparation and use of the compositions are provided, 
where the compositions comprise viral proteins and 
fragments thereof derived from the viral agent for ET- 
NANB. Methods for preparation of ET-NANB viral 

30 proteins include isolating ET-NANB genomic sequences 
which are then cloned and expressed in a host cell. 
The resultant recombinant viral proteins find use as 
diagnostic agents and as vaccines. The genomic 
sequences and fragments thereof find use in preparing 

35 ET-NANB viral proteins and as probes for virus 
detection. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows vector constructions and 
manipulations used in obtaining and sequencing cloned 
ET-NANB fragment; and 

5 Figures 2A-2B are representations of Southern 

blots in which a radiolabeled ET-NANB probe was 
hybridized with amplified cDNA fragments prepared from 
RNA isolated from infected (I) and non-infected (N) 
bile sources (2A), and from infected (I) and non- 
10 infected (N) stool-sample sources (2B). 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
Novel compositions comprising generic 
sequences and fragments thereof derived from the viral 
15 agent f or % ET-NANB are provided, together with 

recombinant viral proteins produced using the genomic 
sequences and methods of using these compositions. 

The genome of the ET-NANB viral agent is 
identified as containing a region which is homologous 
to the 1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KF1 <ET1.1) carried in E. coli strain BB4 and having 
ATCC deposit no. 67717. Initial studies sequenced the 
two terminal regions of the insert and an intermediate 
region. The 5 '-end region of this insert contains the 
25 sequence: 

1 42 
GAT GGA AGG CAC TAA TCT GGC AAG ACC TGT CCC TGT TGC AGC 

84 

TGT TCT ACC ACC CTG CCC CGA GCT CGA ACA GGG CCT TCT CTA 
85 

126 

CCT GCC CCA GGA GCT CAC ACA CCC TGT GAT AGT GTC GTA ACA 

35 

• 127 

168 

TTT GAA TTA ACA GAC ATT GTG CAC TGC CGC ATG GCC GCC CCG 



20 
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169 210 
AGC CAG CGC AAG GCC GTG CTG TCC ACA CTC GTG GGC CGC TAC 

5 211 
GGC. 

An intermediate region has the sequence: 

10 691 731 
CTA GAG TGT GCT ATT ATG GAG GAG TGT GGG ATG CCG CAG TGG 

733 774 
CTC ATC CGC CTG TAT CAC CTT ATA AGG TCT GCG TGG ATC TTG 

15 

775 816 
CAG GCC CCG AAG GAG TCT CTG CGA GGG TTT TGG AAG AAA CAC 

817 8S8 
20 TCC GGT GAG CCC GGC ACT CTT CTA TGG AAT ACT GTC TGG AAT 

859 900 
ATG GCC GTT ATT ACC CAC TGT TAT GAC TTC CGC GAT TTT CAG 

25 901 942 
GTG GCT GCC TTT AAA GGT GAT GAT TCG ATA GTG CTT TGC AGT 

943 984 
GAG TAT CGT CAG AGT CCA GGA GCT GCT GTC CTG ATC GCC GGC 

30 

985 1026 
TGT GGC TTG AAG TTG AAG GTA GAT TTC CGC CCG ATC GGT TTG 

1027 
35 TAT. 
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The 3* -end region contains the sequence: 

1191 1232 
TGA GTA GAG GAT GTT GTT TCC CGT GTT TAT GGG GTT TCC CCT 

1233 

JJ 1274 
GGA CTC GTT CAT AAC CTG ATT GGC ATG CTA CAG GCT GTT GCT 

1275 1316 
GAT GGC AAG GCA CAT TTC ACT GAG TCA GTA AAA CCA GTG CTC 

1317 1327 
GAC CGG AAT TC. 

Additional work has provided the entire sequence, in 
both directions, as set forth below. The sequence of 
both strands is provided, since it is not known in 
which strand the gene is located. However, the 
sequence in one direction has been designated as the 
"forward" sequence because of statistical similarities 
to known proteins. This sequence is set forth below 
along with the three possible translation sequences. 
There is one long open reading frame that starts at 
nucleotide 145 with an isoleucine and extends to the 
end of the sequence. The two other reading frames have 
many termination codons. Standard abbreviations for 
nucleotides and- amino acids are used here and elsewhere 
in this specification. 

Forward Sequence 

RIP PTDGRHZSGKTCPCCSC 

EPRQLMEGTNLARPVPVAAV 

NSA NZWKALIWQDLSLLQ LF 
* * 

1 11 21 31 41 51 

CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 
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STTLPRARTGPSLPAPG AH H 
LPPCPELEQGLLYLPQELTT 

YHPAPSSNRAPSTCPRSSPP 
****** 

61 71 81 91 101 Hi 

TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 

L Z ZCRNIZINRH CALPHGRP 
CDSVVTPELTDIVHCRMAAP 

VIVSZHLNZQTLCTAAWPPR 
****** 

121 131 141 151 161 171 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 

EPAQGRAVHTRGPLRRRTKL 
SQRKAVLSTLVGRYG7AQSS 

ASARPCCPHSWAATASHKAL 

* * * * * * 

181 191 201 211 221 231 

GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

YNASHSDVRDSLARFIPAIG 
TMLPTLMFATLSPV LSRPLA 

QCPPLZCSRLSRPFYPGHWP 

* * * * * * 

241 251 261 271 281 291 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 

PVQVTTCELYELVEAMVEKG 

PYRLQLVNCTSZWRPWSRRA 

RTGYNLZIVRASGGHGREGP 
****** 

301 311 321 331 341 351 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 
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QDGSAVLELDLCNRDVSRIT 
RMAPPSLSLIPATVTCPGSP 

GWLRRPZAZSLQPZRVQ DHL 

* ***** 

361 371 381 391 401 411 

CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTGACGTGTCCAGGATCACC 

FFQKDCNKPrTGETIAHGKV 

SSRKIVTSSPQVRPLPMVKW 

LPERL ZQVHHRZDHCPWZSG 
****** 

421 431 441 451 461 471 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 

GQGIS A WSKTPCALFGPWPR 
ARASRPGARPSAPSL ALGSA 

PGHLGLEQDLLRPLWPLVPR 

* * * * * * 

481 491 501 511 521 531 

GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCITGGTTCCGC 

AIEKAILALLPQGVPyGDAP 
LLRRL PWPCSLR VCPTVMP L 

YZEGY SGPAPSGCVLRZCLZ 

* * * * * * 

541 551 561 571 581 591 

GCTATTGAGAAGGCTATTCTGGCCCTGCTCCCTCAGGGTGTGTTTTACGGTGATGCCTTT 

DDTVFS AAVAAAKASMVPEN 
MTPSSRRLWPQQRHPWCLRM 

ZHRLLGGCGRSKGIHGVZEZ 

* * * * * * 

601 611 621 631 641 651 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGCATCCATGGTGTTTGAGAAT 
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DFS.EFDSTQNNFSLGLECA I 
TFLSLTPPRITFLWVZSVLL 

LFZVZLHPEZLFSGS-RVCYY 
****** 

661 671 681 691 701 711 

GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTGTGCTATT 

MEECGMPQWLIRLYHL IRSA 
WRSVGCRSGSSACITLZGLR 

GGVWDAAVAHPPVSPYKVCV 
****** 

721 731 741 751 761 771 

ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 

WILQAPKESLR GFWKKHS GE 

GSCRPRRSLCEGFGRN TPVS 

D LAGPEG7SARVLEETLRZA 
* * * * * * 

781 791 801 811 821 831 

TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

PGTLLWNTVWNMAVITHCYD 

PALFYGILSGIWPLLPTVMT 

RHSSMEYCL EYGRYYPLLZL 
****** 

841 851 861 871 881 891 

CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 

FRDFQVAAFKGDDSIVLCSE 

SAIFRWLPLKVMIRZCFAVS 

PRFSGGCLZRZZFD SALQZV 
****** 

901 911 921 931 941 951 

TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 
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SRQSPGAAVLIAGCGL.K'LKV 
IVRVQELLS ZSPAVAZ S Z R Z 

SSESRSCCPDRRLWLE7BGR 

* * * * * * 

961 971 981 991 1001 1011 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

DFRP IGLYAGVVVAPGLGAL 
IS ARSVCMQVLWWPPAL ARS 

FPPDRF VCRCCGGPRPWRAP 

* * * * * 

1021 1031 1041 1051 1061 1071 

GATTTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 

PDVVRPAGRLTE .K.N WGPGPE 
LMLCASPAG.LPRRlGALAtS 

ZCCALRRPAYREELGP WP ZA 

* * * * * * 

1081 1091 1101 1111 1121 1131 

CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 

RAEQLRLA VSDPLRKLTNVA 

GRSSSASLLVISSASSRMZL 

GGAAPPRC ZZPP PQAHEC SS 
. * * * * 

1141 1151 1161 U71 iiBi u 91 

CGGGGGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 

QMCVDVVSRVYGVSPGLVHN 
RCVWHLPPVPMGPPLDSPIT 

DVCG CCPPCLWG PPMTRSZP 
* * * * * 

1201 1211 1221 1231 1241 1251 

CAGATGTGTCTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 
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LIGML QAVADG KAHFTESVK 
ZLACYRLLLM ARHISLSQZN 

DWHATGCCZWQGTPHZV SKT 

* * * * * - * 

1261 1271 1281 1291 1301 1311 

CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

P VLDRNSS 
Q C S T G I R 

S A R F E F E 

* * * 

1321 1331 1341 

CCAGTGCTCGACCGGAATTCGAGC 

The complimentary strand, referred to here as 
the "reverse sequence," is set forth below in the same 
manner as the forward sequence set forth above. 
Several open reading frames, shorter than the long open 
reading frame found in the forward sequence, can be 
seen in this reverse sequence. 

Reverse Sequence 

ARIPVEHWFYZLSE MCLAIS 

LEFRSSTGFTDSVKCALPSA 

SNSGRALVLLTQZNVPCHQQ 
****** 

1 11 21 31 41 51 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 

NSLZHANQVMNESRGNPINT 

TACSMPIRLZTSPGETPZTR 

QPVACQSGYERVQGKPHKHG 
* * * * * * 

61 71 81 91 101 111 

AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 



WO 89/12462 



PCT/US89/02648 



12 

GNNIHTHLS YIRELAEEITN 

ETTSTHIZA TPV SL RRKSLT 

KQHPHTSELHSZACGGNHZQ 

* ***** 

121 131 141 151 161 i 71 

GGAAACAACATCCACACACATCTGAGCTACATTCGTGAGCTTGCGGAGGAAATCACTAAC 

SE AELLRPLRARAPILLGKP 
ARRSCSARSGPGPQPFSVSR 

R GGAA PPAQGQGPNSSRZAG 

* * * * * * 

181 "I 201 211 221 231 

AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

AGEAH NIRERAKAGGHHNTC 
PAK.RTTSGSAPRPGATTTPA 

RRSAQHQGARQGRGPPQHLH 

* * * * * * 

241 2 51 261 271 281 291 

GCCGGCGAAGCGCAC^CATCAGGGAGCGCGCCAAGGCCGGGGGCCACCACAACACCTGC 

IQTDRAEIYLQLQATAGDQD 

XKP IGRKSTPNPKPQPAIRT 
TNRSGGNL P S T S S H S R R S G Q 

* * * * * * 
301 311 321 331 341 351 

ATACAAACCGATCGGGCGGAAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGGAC 

SSSWTLTILTAKHYRIITPK 
A A P G L Z R.Y SLQSTIESSPLK 

QLLDSDDTHCKALSNHHLZR 

* ***** 

361 371 381 391 401 411 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 
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GSHLKIAEVITVGNNGHIPD 

AATZKSRKSZQWVITAIFQT 

QPPENRGSHNSGZZRPY SRQ 
* * * * * * 

421 431 441 451 461 471 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAATAACGGCCATATTCCAGAC 

SIPZKSAGLTGVFLPKPSQR 

VPHRHVPGSPECPPQNPRRD 

YSIEECRAHRSVSSKTLA ET 
****** 

481 491 501 511 521 531 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 

LLRGLQDPRRPYKVIQADEP 
SPGACKIHADLIRZYRRMSH 

PSGPARSTQTLZGDTGGZAT 
* * * * * * 

541 551 561 571 581 591 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

LRHPTLLHNSTLZTQRKVIL 

CG IPHSSIIAHSRPREKLPW 

AASHTPPZZHTLDP EKSYSG 
****** 

601 611 621 631 641 651 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

GGVKLRKVILKHH. GCL CCG H 

VESNS EKSPSNTMDAPAAAT 

WSQTQKSHSQTPWMPLLRPQ 
****** 

661 671 681 691 701 711 

GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 
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SRREDGVIKGI TVKHTLREQ 
AAEKTVSSKASPZ NTPZGSR 

PPRRRCHQRHHRKTHPEGA G 
****** 

721 731 741 751 761 7 71 

AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 

GQNSLLNSAEPRA KEGAEGL 

ARIAPSIARNQGPKRAQKVL 

PEZPS QZRGTKGQ RGRRRSC 
.****** 

781 791 801 811 821 831 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 

APGRDALAHPTMGNGLTCGE 

LQ AEMPWPTLPWAMVSPVVN 
SRPRCPGPtlTHGQW S H L W Z T 
* * * * * * 

841 851 861 871 881 891 

GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 

LVTIPLEEGDPGHVTVAKIK 

LLQSFWK KVILDTSRL QRSS 

CZNI,SGRRZSWTRHGCKDQA 

* * 

901 911 921 931 941 951 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 

LKDGGAILALLDHGLH Z LVQ 

SRTAEP S WPPSTHASTSS YN 
QGRRSHPGPSRPWPPLARTI 

961 971 "I 991 1001 1011 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 
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FTSCNL5fGANGRDKTGERVA 
SQVVTCTGPMAG IKRARESR 

HKLZ PVRGQWPGZNGRESRE 

* * * * * * 

1021 1031 1041 1051 1061 1071 

TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 

NIRVGSIVELCATPZRPTSV 
TSEWEALZ SPVRRRSGPRVW 

HQ SG KHCRALCDAVAAHECG 

* * * * * * 

1081 1091 1101 1111 1121 1131 

AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 

DSTALRWLGAAMRQCTMSVN 
TARPCAGSGRPCGSAQCLL I 

QH GLALARGGHAAVHNVCZ P 

* * * * * * 

1141 1151 1161 1171 1181 1191 

GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 

SNVTTLSQVVSSWGRZRRPC 
QMLRHYHRWZAPGAGREGPV 

KCYDTITGGELLGQ VEKALF 

* * * * * * 

1201 1211 1221 1231 1241 1251 

TCAAATGTTACGACACTATCACAGGTGGTGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 

SSSGQGGRTAATGTGLARLV 

RARGRVVEQLQQGQVLPDZ C 

ELGAGWZNSCNRDRSCQISA 
****** 

1261 1271 1281 1291 1301 1311 

TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATTAGTG 
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PSISWRNS 
L P S V G G I 

P H Q L A E P 
* * * 

1321 1331 1341 

CCTTCCATCAGTTGGCGGAATTCG 



Identity of this sequence with sequences in 
etiologic agents has been confirmed by locating a 
corresponding sequence in a viral strain isolated in 
Burma. The Burmese isolate contains the following 
sequence of nucleotides. 

Non-A Non-B ETi Burmese strain 

1 CGGTTGTTCA GTACCAGTTT ACTGCAGGTG TGCCTGGATC CGGCAAGTCC 
51 CGCTCTATCA CCCAAGCCGA TGTGGACGTT GTCGTGGTCC CGACGCGTGA 
101 GTTGCGTAAT GCCTGGCGCC GTCGCGGCTT TGCTGCTTTT ACCCCGCATA 
151 CTGCCGCCAG AGTCACCCAG GGGCGCCGGG TTGTCATTGA TGAGGCTCCA 
201 TCCCTCCCCC CTCACCTGCT GCTGCTCCAC ATGCAGCGGG CCGCCACCGT 
251 CCACCTTCTT GGCGCCCCGA ACCAGATCCC AGCCATCGAC TTTGAGCACG 
301 CTGGGCTCGT CCCCGCCATC AGGCCCGACT TAGCCCCACC TCCTGGTGGC 
351 ATGTTACCCA TCGCTGCCCT GCGGATGTAT GCGAGCTCAT CCGTGGTGCA 
401 TACCCCATGA TCCAGACCAC TAGCCGGGTT CTCCGTTCGT TGTTCTGGGG 
451 TGAGCCTGCC GTCGGGCAGA AACTAGTGTT .CACCCAGGCG GCCAAGGCCG 
501 CCAACCCCGG CTCAGTGACG GTCCACGAGG CGCAGGGCGC TACCTACACG 
551 GAGACCACTA TTATTGCCAC AGCAGATGCC CGGGGCCTTA TTCAGTCGTC 
601 TCGGGCTCAT GCCATTGTTG CTCTGACGCG CCACACTGAG AAGTGCGTCA 
651 TCATTGACGC ACCAGGCCTG CTTCGCGAGG TGGGCATCTC CGATGCAATC 
701 GTTAATAACT TTTTCCTCGC TGGTGGCGAA ATTGGTCACC AGCGCCCATC 
751 AGTTATTCCC CGTGGC»ACC CTGACGCCAA TGTTGACACC CTGGCTGCCT 
801 TCCCGCCGTC TTGCCAGATT AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 
851 GGCCACAGAC CTGTCCCTGT TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
901 CGAACAGGGC CTTCTCTACC TGCCCCAGGA GCTCACCACC TGTGATAGTG 
951 TCGTAACATT TGAATTAACA GACATTGTGC ACTGCCGCAT GGCCGCCCCG 
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1001 AGCCAGCGCA AGGCCGTGCT GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
1051 ACAAAGCTCT ACAATGCTTC CCACTCTGAT GTTCGCGACT CTCTCGCCCG 
1101 TTXTATCCCG GCCATTGGCC CCGTACAGGT TACAACTTGT GAATTGTACG 
1151 AGCTAGTGGA GGCCATGGTC GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
5 1201 GAGCTTGATC TTTGCAACCG TGACGTGTCC AGGATCACCT TCTTCCAGAA 

1251 AGATTGTAAC AAGTTCACCA CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
1301 GCCAGGGCAT CTCGGCCTGG AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 
1351 TGGTTCCGCG CTATTGAGAA GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
1401 GTTTTACGGT GATGCCTTTG ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
10 1451 CAGCAAAGGC ATCCATGGTG TTTGAGAATG ACTTTTCTGA GTTTGACTCC 

1501 ACCCAGAATA ACTTTTCTCT GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
1551 TGGGATGCCG CAGTGGCTCA TCCGCCTGTA TCACCTTATA AGGTCTGCGT 
1601 GGATCTTGCA GGCCCCGAAG GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
1651 TCCGGTGAGC CCGGCACTCT TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
15 1701 TATTACCCAC TGTTATGACT TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 

1751 GTGATGATTC GATAGTGCTT TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
1801 GCTGTCCTGA TCGCCGGCTG TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 
1851 GATCGGTTTG TATGCAGGTG TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
1901 CTGATGTTGT GCGCTTCGCC GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
20 1951 GGCCCTGAGC GGGCGGAGCA GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 

2001 CAAGCTCACG AATGTAGCTC AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
2051 ATGGGGTTTC CCCTGGACTC GTTCATAACC TGATTGGCAT GCTACAGGCT 
2101 GTTGCTGATG GCAAGGCACA TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
2151 CTTGACAAAT TCAATCTTGT GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
25 2201 TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT TGTTGCTGCT 

2251 CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
2301 GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT 
2351 GACCGGGTTG ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA 
2401 CCCCTTCGCC CCCGATGTCA CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC 
30 2451 GCCAACCCGC CCGACCACTC GCGTCCGCTT GGCGTGACCA GGCCCAGCGC 

2501 CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGCT 
2551 AACCGCGGTC GCTCCGGCCC CG 

The ability of the methods described herein to 
35 isolate and identify genetic material from other NANB 
hepatitis- strains has been confirmed by identifying 
genetic material from an isolate obtained in Mexico. 
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The sequence of this isolate was about 75% identical to 
the ET1.1 sequence set forth above. The sequence was 
identified by hybridization using the conditions set 
forth in Section II. B below. A partial cDNA sequence 
5 consisting of 1651 nucleotides is set forth below. 



Non-A Non-B ET: Mexican Strain 



1 GCGGCCGCTC GGTGGGGTTC 
10 51 TAAGCAAGGA ATTAATTCGC 

101 CAGATGCCAT TGTTAATAAT 
151 CAGAGACCAT CGGTGATTCC 
201 CTTGCGGCGT TTCCACCTTC 
250 TGAGGAGCTG GGCCACCGGC 
15 301 GAGCTTGAGC AGGGCCTTCT 

350 CAGTGTTGTG ACATTTGAGC 
401 CCCCTAGCCA AAGGAAAGCT 
451 AGACGCACAA GGCTTTATGA 
501 TGCGCGCTTT ATTCCCACTC 
20 551 TCTTTGAGCT TGTAGAGGCG 

601 GTCCTCGAGT TGGATTTGTG 
651 CCAGAAGGAT TGTAACAAGT 
701 AAGTCGGTCA GGGTATCTCC 
751 GGCCCCTGGT TCCGTTGCGA 
25 801 AAGCTGTGTT CTACGGGGAT 

851 GTGGCTGGCG CCAGCCATGC 
901 TGACTCGACT CAGAATAACT 
951 AAGAGTGTGG TATGCCCCAG 
1001 TCGGCGTGGA TCCTGCAGGC 
30 1051 GAAGCATTCT GGTGAGCCGG 

1101 TGGCAATCAT TGCCCATTGC 
1151 TTCAAGGGCG ACGACTCGGT 
1201 AGGCGCCGGT TCGCTTATAG 
1251 TCCGGCCGAT TGGGCTGTAT 
35 1301 GCCCTACCCG ATGTCGTTCG 

1351 GGGGCCTGAT CCGGAGCGGG 
1401 TCCTCCGTAG GTTAACGAAT 



CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 
TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
GCGAGCAACC CTGACCGCAA TGTTGACGTG 
ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 
TAACTGACAT TGTGCACTGC CGCATGGCGG 
GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 
TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 
CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
TCACGACCGG CGAGACAATT GCGCATGGCA 
GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
TTGAGAAGGG TATTCTATCC CTTTTACCAC 
GCTTATGACG ACTCAGTATT CTCTGCTGCC 
CATGGTGTTT GAAAATGATT TTTCTGAGTT 
TTTCCCTAGG TCTTGAGTGC GCCATTATGG 
TGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
GCACGTTGCT CTGGAATACG GTGTGGAACA 
TATGAGTTCC GGGACCTCCA GGTTGCCGCC 
CGTCCTCTGT AGTGAATACC GCCAGAGCCC 
CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 
CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
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1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 
1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
5 1651 CGAGCGGCCG C 

When comparing the Burmese and Mexican 
strains, 76.7% identity is seen in a 1372 nucleotide 
overlap beginning at nucleotide 13 of the Mexican 

10 strain and nucleotide 331 of the Burmese strain. 

It will be recognized by one skilled in the 
art of molecular genetics that each of the last two 
sequences teaches a corresponding complementary DNA 
sequence as well as RNA sequences corresponding to both 

15 the principal sequence shown and the complementary DNA 
sequence. Additionally , open reading frames encoding 
peptides are present, and expressible peptides are 
disclosed by the nucleotide sequences without setting 
forth the amino acid sequences explicitly, in the same 

20 manner as if the amino acid sequences were explicitly 
set forth as in the ET1.1 sequence above. 



DETAILED DESCRIPTION OF THE INVENTION 
25 ~ 

I. Definitions 

The terms defined below have the following 
meaning herein: 

30 

1. "Enterically transmitted non-A/non-B (ET-NANB) 
hepatitis viral agent" means a virus, virus type, or 
virus class which (1) causes water-borne, infectious 
hepatitis, (ii) is transmissible in cynomolgus monkeys, 
35 (iii) is serologically distinct from hepatitis A virus 
(HAV) and hepatitis B virus (HAB), and (iv) includes a 
genomic region which is homologous to the 1.33 kb cDNA 



WO 89/12462 PCT/US89/02648 



10 



15 



20 



insert in plasmid pTZ-KFl(ETl.l) carried in e. coli 
strain BB4 identified by ATCC deposit number 67717. 

2. Two nucleic acid fragments are "homologous" if 
they are capable of hybridizing to one another under 
hybridization conditions in which hybridized strands 
contain at most about 25-30% basepair mismatches. In 
general, two single-strand nucleic acid species will be 
homologous if they hybridize under the conditions 
described in Maniatis et al. , op. cit. , pp. 320-323, 
but using the following wash conditions: 2 x SCC, 0.1% 
SDS, room temperature - twice, 30 minutes each; then 2 
x SCC, 0.1% SDS, 50°C - once, 30 minutes; then 2 x SCC, 
room temperature - twice, 10 minutes each. More 
preferably, homologous nucleic acid strands contain 15- 
25% basepair mismatches, even more preferably 5-15% 
basepair mismatches. These degrees of homology can be 
selected by using more stringent wash conditions for 
identification of clones from gene libraries (or other 
sources of genetic material) ,, as is well known in the 

20 art . 

3. A DNA fragment is "derived from" an ET-NANB 
viral agent if it has the same or substantially the 
same basepair sequence as a region of the viral agent 
genome . 

25 4. A protein is "derived from" an ET-NANB viral 

agent if it is encoded by an open reading frame of a 
DNA or RNA fragment derived from an ET-NANB viral 
agent . 



30 



35 



11 • Obtainin g Cloned ET-NANB Fragments 

According to one aspect of the invention, it 
has been found that a virus-specific DNA clone can be 
produced by (a) isolating RNA from, the bile of a 
cynomolgus monkey having a known ET-NANB infection, (b) 
cloning the cDNA fragments to form a fragment library, 
and (c) screening the library by differential 
hybridization to radiolabeled cDNAs from infected and 
non-infected bile sources. 
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A. cDNA Fragment Mixture 

ET-NANB infection in cynomolgus monkeys is 
initiated by inoculating the animals intravenously with 
5 a 10% w/v suspension from human case stools positive 
for 27-34 nm ET-NANB particles (mean diameter 32 nra). 
An infected animal is monitored for elevated levels of 
alanine aminotransferase, indicating hepatitis 
infection. ET-NANB infection is confirmed by 
10 imraunospecif ic binding of seropositive antibodies to 
virus-like particles (VLPs), according to published 
methods (Gravelle). Briefly, a stool (or bile) 
specimen taken from the infected animal 3-4 weeks after 
infection is diluted 1:10 with phosphate-buffered 
15 saline, and the 10% suspension is clarified by low- 
speed centrifugation and filtration successively 
through 1.2 and 0.45 micron filters. The material may 
be further purified by pelleting through a 30% sucrose 
cushion (Bradley). The resulting preparation of VLPs 
20 is mixed with diluted serum from human patients with 
known ET-NANB infection. After incubation overnight, 
the mixture is centrifuged overnight to pellet immune 
aggregates, and these are stained and examined by 
electron microscopy for antibody binding to the VLPs. 
25 ET-NANB infection can also be confirmed by 

seroconversion to VLP-positive serum. Here the serum 
of the infected animal is mixed as above with 27-34 nm 
VLPs isolated form the stool specimens of infected 
human cases and examined by immune electron microscopy 
30 for antibody binding to the VLPs. 

Bile can be collected from ET-NANB positive 
animals by either cannulating the bile duct and 
collecting the bile fluid or by draining the bile duct 
during necropsy. Total RNA is extracted from the bile 
35 by hot phenol extraction, as outlined in Example 1A. 
The RNA fragments are used to synthesize corresponding 
duplex cDNA fragments by random priming, also as 
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referenced in Example 1A. The cDNA fragments may be 
fractionated by gel electrophoresis or density gradient 
centrifugation to obtain a desired size class of 
fragments, e.g., 500-4,000 basepair fragments. 
5 Although alternative sources of viral material, 

such as VLPs obtained from stool samples (as described 
in Example 4), may be used for producing a cDNA 
fraction, the bile source is preferred. According to 
one aspect of the invention, it has been found that 

10 bile from ET-NANB-infected monkeys shows a greater 

number of intact viral particles than material obtained 
from stool samples, as evidenced by immune electron 
microscopy. Bile obtained from an ET-NANB infected 
human or cynomolgus monkey, for use as a source of ET- 

15 NANB viral protein or genomic material, or intact 
virus, forms part of the present invention. 

B. cDNA Library and Screening 

The cDNA fragments from above are cloned into a 
suitable cloning vector to form a cDNA library. This 
may be done by equipping blunt-ended fragments with a 
suitable end linker, such as an EcoRI sequence, and 
inserting the fragments into a suitable insertion site 
of a cloning vector, such as at a unique EcoRI site. 
After initial cloning, the library may be recloned, if 
desired, to increase the percentage of vectors 
containing a fragment insert. The library construction 
described in Example IB is illustrative. Here cDNA 
fragments were blunt-ended, equipped with EcoRI ends, 
and inserted into the EcoRI site of the lambda phage 
vector gtlO. The library phage, which showed less than 
5% fragment inserts, was isolated, and the fragment 
inserts recloned into the lambda gtlO vector, yielding 
more than 95% insert-containing phage. 

The cDNA library is screened for sequences 
specific for ET-NANB by differential hybridization to 
cDNA probes derived from infected and non-infected 
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sources. cDNA fragments from infected and non-infected 
source bile or stool viral isolates can be prepared as 
above. Radiolabeling the fragments is by random 
labeling, nick translation, or end labeling, according 
5 to conventional methods (Maniatis, p. 109). The cDNA 
library from above is screened by transfer to duplicate 
nitrocellulose filters, and hybridization with both 
infected-source and non-infected-source (control) 
radiolabeled probes, as detailed in Example 2. in 
order to recover sequences that hybridize at the 
preferred outer limit of 25-30% basepair mismatches, 
clones can be selected if they hybridize under the 
conditions described in Maniatis et al. , op. cit. , pp. 
320-323, but using the following wash conditions: 2 x 
SCC, 0.1% SDS, room temperature - twice, 30 minutes 
each; then 2 x SCC, 0.1% SDS, 50°C - once, 30 minutes; 
then 2 x SCC, room temperature - twice, 10 minutes 
each. These conditions allowed identification of the 
Mexican isolate discussed above using the ET1.1 
sequence as a probe. Plaques which show selective 
hybridization to the infected-source probes are 
preferably replated at low plating density and 
rescreened as above, to isolate single clones which are 
specific for ET-NANB sequences. As indicated in 
Example 2, sixteen clones which hybridized specifically 
with infected-source probes were indentified by these 
procedures. One of the clones, designated lambda gtlO- 
1.1, contained a 1.33 kilobase fragment insert. 

C. ET-NANB Sequences 

The basepair sequence of cloned regions of the 
ET-NANB fragments from Part B are determined by 
standard sequencing methods. In one illustrative 
method, described in Example 3, the fragment insert 
from the selected cloning vector is excised, isolated 
by gel electrophoresis, and inserted into a cloning 
vector whose basepair sequence on either side of the 
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insertion site is known. The particular vector 
employed in Example 3 is a pTZ-KFl vector shown at the 
left in Figure 1. The ET-NANB fragment from the gtlO- 
1.1 phage was inserted at the unique EcoRi site of the 
5 pTZ-KFl plasmid. Recombinants carrying the desired 
insert were identified by hybridization with the 
isolated 1.33 kilobase fragment, as described in 
Example 3. One selected plasmid, identified as pTZ-KFl 
(ET1.1), gave the expected 1.33 kb fragment after 
10 vector digestion with EcoRI . E. coli strain BB4 
infected with the pTZ-KFl(ETl.l) plasmid has been 
deposited with the American Type Culture Collection, 
Rockville, MD, and is identified by ATCC deposit nubmer 
67717. 

15 The PTZ-KFI(ETI.I) plasmid is illustrated at 

the bottom in Figure 1. The fragment insert has 5' and 
3' end regions denoted at A and C, respectively, and an 
intermediate region,' denoted at B. The sequences in 
these regions were determined by standard dideoxy 
sequencing and are set forth above. The three short 
sequences (A, B, and C) are from the same insert 
strand. As will be seen in Example 3, the B-region 
sequence was actually determined from the opposite 
strand, so that the B-region sequence shown above 
25 represent the complement of the sequence in the 

sequenced strand. The base numbers of the partial 
sequences are approximate. 

Later work in the laboratory of the inventors 
identified the full sequence, also set forth above. 
Fragments of this total sequence can readily be 
prepared using restriction endonucleases . Computer 
analysis of both the forward and reverse sequence has 
identified a number of cleavage sites. The specific 
cleavage sites are summarized (for the forward 
35 direction) in the following tables. 
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Pattern identifier Pattern matched Base number matched 

(* = cleavage site) i n "forward" strand 



5 


CC*WGG(BstNI) 


CCWGG 


106, 


360, 


410, 


483, 








497, 


973, 


1129, 


1243 

^fe mm «J 




CC*SGG(NciI) 


CCSGG 


288, 


841, 


1063 






GAAGANNNNNNNN* ( Mbol I ) 


GAAGA 


822, 


1116 








TCTTC(<-7-MboII) 


TCTTC 


422, 


611, 


849 




10 


GCGTC(<-10-HgaI) 


GCGTC 


225 










GCATCNNNNN* ( Sf aNI ) 


GCATC 


488, 


640 








GCAGCNNNNNNNN* ( Bbvl ) 


GCAGC 


53, 


631, 


1149 






GCTGC(<-12-BbvI) 


GCTGC 


919, 


979 








GGATGNNNNNNNNN * ( Fok I ) 


GGATG 


363, 


734, 


1212 




15 


CATCC(<-13-PokI) 


CATCC 


641, 


750 








GGTGANNNNNNNN* ( HphI ) 


GGTGA 


454, 


589, 


835, 


931 




TCACC(<-7-HphI) 


TCACC 


114, 


416, 


446, 


762 




GP*CGYC(AhaII) 


GPCGYC 


224 










GDGCH*C(BspI1286) 


GDGCHC 


77, 


110, 


158, 


838, 


20 






1125, 


1324 








GPGCY*C(BanII) 


GPGCYC 


77, 


110, 


838, 


1125 




C*YCGPG(AvaI) 


CYCGPG 


74, 


178 








Y*GGCCP(EaeI) 


YGGCCP 


171, 
1101 


290, 


626, 


875 r 


25 


GWGCW*C(GgiAI) 


GWGCWC 


77, 


110, 


158 . 

• WW, 


1324 

M W *9 




C*CTTGG(StyI) 


CCTTGG 


529, 


1068 








P*GATCY(XhoII) 


PGATCY 


782 










CAG*CTG(PvuII) 


CAGCTG 


54 










C*CATGG ( Ncol ) 


CCATGG 


344, 


468, 


644 




30 


CGAT*CG ( Pvul ) 
C*GGCCG(EagI) 


CGATCG 
CGGCCG 


1031 
1101 










G* AATTC ( EcoRl ) 


GAATTC 


2, 1335 








GAGCT*C ( SacI ) 


GAGCTC 


77, 


110 








GCATG*C(SphI) 


GCATGC 


1268 








35 


GCC*GGC(NaeI) 

G*CGCGC(BssHII) 

GGGCC*C(ApaI) 


GCCGGC 
GCGCGC 
GGGCCC 


994, 
1073 
1125 


1099, 


1103 
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TCG*CGA(NruI) 


TCGCGA 


264 


T*CTAGA ( Xbal ) 


TCTAGA 


705 


TTT*AAA(DraI) 


TTTAAA 


• 925 


G*TGCAC ( ApaLI ) 


GTGCAC 


158 


ACCTGCNNNN* ( BpsMI ) 


ACCTGC 


99 


GCAGGT ( <-8-BspMI ) 


GCAGGT 


1045 


GACN*NNGTC ( Tthllll ) 


GACNNNGTC 


604 


CCANNNN*NTGG ( Pf 1MI ) 


CCANNNNNTGG 


10 


CC*TNAGG(MstII) 


CCTNAGG 


571 


GCCNNNN*NGGC ( Bgll ) 


GCCNNNNNGGC 


216, 


CCANNNNN * NTGG ( Bs tXI ) 


CCANNHNNNTGG 


204 



359, 



738 



20 



III. ET-NANB Fragments 
15 According to another aspect, the invention 

includes ET-NANB-specific fragments or probes which 
hybridize with ET-NANB genomic sequences or cDNA 
fragments derived therefrom. The fragments may include 
full-length cDNA fragments such as described in Section 
II, or may be derived from shorter sequence regions 
within cloned cDNA fragments. Shorter fragments can be 
prepared by enzymatic digestion of full-length 
fragments under conditions which yield desired-sized 
fragments, as will be described in Section IV. 
25 Alternatively, the fragments can be produced by 

oligonucleotide synthetic methods, using sequences 
derived from the cDNA fragments. Methods or services 
for producing selected-sequence oligonucleotide 
fragments are available. 

30 To confirm that a given ET-NANB fragment is in 

fact derived from the ET-NANB viral agent, the fragment 
can be shown to hybridize selectively with cDNA from 
infected sources. By way of illustration, to confirm 
that the 1.33 kb fragment in the pTZ-KFl(El.l) plasmid 

35 is ET-NANB in origin, the fragment was excised from the 
pTZ-KPl(ETl.l) plasmid, purified, and radiolabeled by 
random labeling. The radiolabeled fragment was 
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hybridized with fractionated cDNAs from infected and 
non-infected sources to confirm that the probe reacts 
only with infected-source cDNAs. This method is 
illustrated in Example 4, where the above radiolabeled 
5 1.33 kb fragment from pTZ-KFl(ETl.l) plasmid was 

examined for binding to cDNAs prepared from infected 
and non-infected sources. The infected sources are (1) 
bile from a cynomolgus monkey infected with a strain of 
virus derived from stool samples from human patients 
10 from Burma with known ET-NANB infections and (2) a * 
viral agent derived from the stool sample of a human 
ET-NANB patient from Mexico. The cDNAs in each 
fragment mixture were first amplified by a 
linker/primer amplification method described in Example 
15 4. Fragment separation was on agarose gel, followed by 
Southern blotting and then hybridization to bind the 
radiolabeled 1.33 kb fragment to the fractionated 
cDNAs. The lane containing cDNAs from the infected 
sources showed a smeared band of bound probe , as 
20 expected (cDNAs amplified by the linker/primer 

amplification method would be expected to have a broad 
range of sizes). No probe binding to the amplified 
cDNAs from the non-infected sources was observed,. The 
results indicate that the 1.33 kb probe is specific for 
25 cDMA fragments associated with ET-NANB infection. This 
same type of study, using ET 1.1 as the probe, has 
demonstrated hybridization to ET-NANB samples collected 
from Tashkent, Somalia, Borneo and Pakistan. Secondly, 
the fact that the probe is specific for ET-NANB related 
30 sequences derived from different continents (Asia, 

Africa and North America) indicates the cloned ET-NANB 
Africa sequence is derived from a common ET-NANB virus 
or virus class responsible for ET-NANB hepatitis 
infection worldwide. 
35 In a related confirmatory study, probe binding 

to fractionated genomic fragments prepared from human 
or cynomolgus monkey genomic DNA was examined. No 
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probe binding was observed to either genomic fraction, 
demonstrating that the ET-NANB fragment is not an 
endogenous human or cynomologus fragment. 

Another confirmation of ET-NANB specific 
5 sequences in the fragments is the ability to express 
ET-NANB proteins from coding regions in the 
fragments. Section IV below discusses methods of 
protein expression using the fragments. 

One important use of the ET-NANB-specif ic 

10 fragments is for identifying ET-NANB-derived cONAs 
which contain additional sequence information. The 
newly identified cDNAs, in turn, yield new fragment 
probes, allowing further iterations until the entire 
viral genome is identified and sequenced. Procedures 

15 for identifying additional ET-NANB library clones and 
generating new probes therefrom generally follow the 
cloning and selection procedures described in Section 
II. 

The fragments (and oligonucleotides prepared 
based on the sequences given above) are also useful as 
primers for a polymerase chain reaction method of 
detecting ET-NANB viral genomic material in a patient 
sample. This diagnostic method will be described in 
Section V below . 

25 

IV. ET-NANB Proteins 

As indicated above, ET-NANB proteins can be 
prepared by expressing open reading-frame coding 
regions in ET-NANB fragments. In one. preferred 
approach, the ET-NANB fragments used for protein 
expression are derived from cloned cDNAs which have 
been treated to produce desired-size fragments, and 
preferably random fragments with sizes predominantly 
35 between about 100 to about 300 base pairs. Example 5 
describes the preparation of such fragments by DNAs 
digestion. Because it' is desired to obtain peptide 
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antigens of between about 30 to about 100 amino acids, 
the digest fragments are preferably size fractionated, 
for example by gel electrophoresis, to select those in 
the approximately 100-300 basepair size range. 

5 

A. Expression Vector 

The ET-NANB fragments are inserted into a 
suitable expression vector. One exemplary expression 
vector is lambda gtll, which contains a unique EcoRl 

10 insertion site 53 base pairs upstream of the 

translation termination codon of the beta-galactosidase 
gene. Thus, the inserted sequence will be expressed as 
a beta-galactosidase fusion protein which contains the 
N- terminal portion of the beta-galactosidase gene, the 

15 heterologous peptide, and optionally the C-terminal 
region of the beta-galactosidase peptide (the C- 
terminal portion being expressed when the heterologous 
peptide coding sequence does not contain a translation 
termination codon) . This vector also produces a 

20 temperature-sensitive repressor (cl857) which causes 
viral lysogeny at permissive temperatures, e.g., 32°C, 
and leads to viral lysis at elevated temperatures, 
e.g., 42°C. Advantages of this vector include: (1) 
highly efficient recombinant generation, (2) ability to 

25 select lysogenized host cells on the basis of host-cell 
growth at permissive, but not non-permissive, 
temperatures, and (3) high levels of recombinant fusion 
protein production. Further, since phage containing a 
heterologous insert produces an inactive beta- 

30 galactosidase enzyme, phage with inserts can be readily 

identified by a beta-galactosidase colored-substrate 
reaction. 

For insertion into the expression vector, the 
viral digest fragments may be modified, if needed, to 
J5 contain selected restriction-site linkers, such as 
EcoRI linkers, according to conventional procedures. 
Example 1 illustrates methods for cloning the digest 
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fragments into lambda gtll, which includes the steps of 
blunt-ending the fragments,, ligating with EcoRI 
linkers, and introducing the fragments into EcoRI-cut 
lambda gtll. The resulting viral genomic library may 
5 be checked to confirm that a relatively large 

(representative) library has been produced. This can 
be done, in the case of the lambda gtll vector, by 
infecting a suitable bacterial host, plating the 
bacteria, and examining the plaques for loss of beta- 
10 galactosidase activity. Using the procedures described 
in Example 1, about 50% of the plaques showed loss of 
enzyme activity. 

B. Peptide Antigen Expression 
15 The viral genomic library formed above is 

screened for production of peptide antigen (expressed 
as a fusion protein) which is immunoreactive with 
antiserum from ET-NANB seropositive individuals. In a 
preferred screening method, host cells infected with 
phage library vectors are plated, as above, and the 
plate is blotted with a nitrocellulose filter to 
transfer recombinant protein antigens produced by the 
cells onto the filter. The filter is then reacted with 
the ET-NANB antiserum, washed to remove unbound 
25 antibody, and reacted with reporter-labeled, anti-human 
antibody, which becomes bound to the filter, in 
sandwich fashion, through the anti-ET— NANB antibody. 

Typically phage plaques which are identified 
by virtue of their production of recombinant antigen of 
interest are re-examined at a relatively low density 
for production of antibody-reactive fusion protein. 
Several recombinant phage clones which produced 
immunoreactive recombinant antigen were identified in 
the procedure. 

The selected expression vectors may be used for 
scale-up production, for purposes of recombinant 
protein purification. Scale-up production is carried 
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out using one of a variety of reported methods for (a) 
lysogenizing a suitable host, such as E. coli , with a 
selected lambda gtll recombinant (b) culturing the 
transduced cells under conditions that yield high 
5 levels of the heterologous peptide, and (c) purifying 
the recombinant antigen frpm the lysed cells. 

In one preferred method involving the above 
lambda gtll cloning vector, a high-producer E. coli 
host, BNN103, is infected with the selected library 
10 phage and replica plated on two plates. One of the 
plates is grown at 32°C, at which viral lysogeny can 
occur, and the other at 42°C, at which the infecting 
phage is in a lytic stage and therefore prevents cell 
growth. Cells which grow at the lower but not the 
15 higher temperature are therefore assumed to be 
successfully lysogenized. 

The lysogenized host cells are then grown under 
liquid culture conditions which favor high production 
of the fused protein containing the viral insert, and 
20 lysed by rapid freezing to release the desired fusion 
protein. 

C. Peptide Purification 

The recombinant peptide can be purified by 

25 standard protein purification procedures which may 
include differential precipitation, molecular sieve 
chromatography, ion-exchange chromatography, 
isoelectric focusing, gel electrophoresis and affinity 
chromatography. In the case of a fused protein, such 

30 as the beta-galactosidase fused protein prepared as 

above, the protein isolation techniques which are used 
can be adapted from those used in isolation of the 
native protein. Thus, for isolation of a beta- 
glactosidase fusion protein, the protein can be 

35 isolated readily by simple affinity chromatography, by 
passing the cell lysis material over a solid support 
having surface-bound anti-beta-galactosidase antibody. 
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D. Viral Proteins 

The ET-NANB protein of the invention may also 
be derived directly from the ET-NANB viral agent. VLPs 
isolated from a stool sample from an infected 
individual, as above, are one suitable source of viral 
protein material. The VLPs isolated from the stool 
sample may be further purified by affinity 
chromatography prior to protein isolation (see 
below). The viral agent may also be raised in cell 
culture, which provides a convenient and potentially 
concentrated source of viral protein. Co-owned a.S. 
Patent Application Serial No. 846,757, filed April 1, 
1986, describes an immortalized trioma liver cell which 
5 supports NANB infection in cell culture. The trioma 
cell line is prepared by fusing human liver cells with 
a mouse/human fusion partner selected for human 
chromosome stability! Cells containing the desired 
NANB viral agent can be identified by immuno- 
fluorescence methods, employing anti— ET-NANB human 
antibodies . 

The viral agent is disrupted, prior to protein 
isolation, by conventional methods, which can include 
sonication, high- or low-salt conditions, or use of 
detergents. 

Purification of ET-NANB viral protein can be 
carried out by affinity chromatography, using a 
purified anti-ET-NANB antibody attached according to 
standard methods to a suitable solid support. The 
antibody itself may be purified by affinity 
chromatography, where an immunoreactive recombinant 
ETNANB protein, such as described above, is attached to 
a solid support, for isolation of anti— ET-NANB 
antibodies from an immune serum source. The bound 
antibody is released from the support by standard 
methods . 
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Alternatively, the anti-ET-NANB antibody may be 
a monoclonal antibody (Mab) prepared by immunizing a 
mouse or other animal with recombinant ET-NANB protein, 
isolating lymphocytes from the animal and immortalizing 
5 the cells with a suitable fusion partner, and selecting 
successful fusion products which react with the 
recombinant protein immunogen. These in turn may be 
used in affinity purification procedures, described 
above, to obtain native ET-NANB antigen. 

10 

V. Utility 



A. Diagnostic Methods 

The particles and antigens of the invention,, as 

15 well as the genetic material, can be used in diagnostic 
assays. Methods for detecting the presence of ET-NANB 
hepatitis comprise analyzing a biological sample such 
as a blood sample, stool sample or liver biopsy 
specimen for the presence of an analyte associated with 

20 ET-NANB hepatitis virus. 

The analyte can be a nucleotide sequence which 
hybridizes with a probe comprising a sequence of at 
least about 16 consecutive nucleotides, usually 30 to 
200 nucleotides, up to substantially the full sequence 

25 of the sequences shown above (cDNA sequences). The 

analyte can be RNA or cDNA. The analyte is typically a 
virus particle suspected of being ET-NANB or a particle 
for which this classification is being ruled out. The 
virus particle can be further characterized as having 

30 an RNA viral genome comprising a sequence at least 
about 80% homologous to a sequence of at least 12 
consecutive nucleotides of the "forward' 1 and "reverse" 
sequences given above, usually at least about 90% 
homologous to at least about 60 consecutive nucleotides 

35 within the sequences, and may comprise a sequence 
substantially homologous to the full-length 
sequences. In order to detect an analyte, where the 
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analyte hybridizes to a probe, the probe may contain a 
detectable label. 

• The analyte can also comprise an antibody which 
recognizes an antigen, such as a cell surface antigen, 
on a ET-NANB virus particle. The analyte can also be a 
ET-NANB viral antigen. Where the analyte is an 
antibody or an antigen, either a labelled antigen or 
antibody, respectively, can be used to bind to the 
analyte to form an immunological complex, which can 
then be detected by means of the label. 

Typically, methods for detecting analytes such 
as surface antigens and/or whole particles are based on 
immunoassays. Immunoassays can be conducted either to 
determine the presence of antibodies in the host that 
15 have arisen from infection by ET-NANB hepatitis virus 
or by assays that directly determine the presence of 
virus particles or antigens. Such techniques are well 
known and need not be described here in detail. Exam- 
ples include both heterogeneous and homogeneous immuno- 
assay techniques. Both techniques are based on the 
formation of an immunological complex between the virus 
particle or its antigen and a corresponding specific 
antibody. Heterogeneous assays for viral antigens 
typically use a specific monoclonal or polyclonal 
25 antibody bound to a solid surface. Sandwich assays are 
becoming increasingly popular. Homogeneous assays, 
which are carried out in solution without the presence 
of a solid phase, can also be used, for example by 
determining the difference in enzyme activity brought 
on by binding of free antibody to an enzyme-antigen 
conjugate. A number of suitable assays are disclosed 
in U.S. Patent Nos. 3,817,837, 4,006,360, 3,996,345. 

When assaying for the presence of : antibodies 
induced by ET-NANB viruses, the viruses and antigens of 
35 the invention can be used as specific binding agents to 
detect either IgG or IgM antibodies, since IgM anti- 
bodies are typically the first antibodies that appear 
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during the course of an infection, when IgG synthesis 
may not yet have been initiated, specifically distin- 
guishing between IgM and IgG antibodies present in the 
blood stream of a host will enable a physician or other 
5 investigator to determine whether the infection is 
recent or chronic. 

In one diagnostic configuration, test serum is 
reacted with a solid phase reagent having surface-bound 
ET-NANB protein antigen. After binding anti-ET-NANB 
10 antibody to the reagent and removing unbound serum 
components by washing, the reagent is reacted with 
reporter-labeled anti-human antibody to bind reporter 
to the reagent in proportion to the amount of bound 
anti-ET-NANB antibody on the solid support. The 
15 reagent is again washed to remove unbound labeled 

antibody, and the amount of reporter associated with 
the reagent is determined. Typically, the reporter is 
an enzyme which is detected by incubating the solid 
phase in the presence of a suitable fluorometric or 
20 color imetric substrate. 

The solid surface reagent in the above assay 
prepared by known techniques for attaching protein 
material to solid support material, such as polymeric 
beads, dip sticks, or filter material. These 
25 attachment methods generally include non-specific 

adsorption of the protein to the support or covalent 
attachment of the protein, typically through a free 
amine group, to a chemically reactive group on the 
solid support, such as an activate carboxyl, hydroxyl, 
30 or aldehyde group. 

In a second diagnostic configuration, known as 
a homogeneous assay, antibody binding to a solid 
support produces some change in the reaction medium 
which can be directly detected in the medium. Known 
35 general types of homogeneous assays proposed heretofore 
include (a) spin-labeled reporters, where antibody 
binding to the antigen is detected by a change in 
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reported mobility (broadening of the spin splitting 
peaks), (b) fluorescent reporters, where binding is 
detected by a change in fluorescence efficiency, ( C ) 
enzyme reporters, where antibody binding effects 
5 enzyme/substrate interactions, and (d) liposome-bound 
reporters, where binding leads to liposome lysis and 
release of encapsulated reporter. The adaptation of 
these methods to the protein antigen of the present 
invention follows conventional methods for preparing 
10 homogeneous assay reagents. 

In each of the assays described above, the 
assay method involves reacting the serum from a test 
individual with the protein antigen and examining the 
antigen for the presence of bound antibody. The 
examining may involve attaching a labeled anti-human 
antibody to the antibody being examined, either IgM 
(acute phase) or IgG (convalescent phase), and 
measuring the amount of reporter bound to the solid 
support, as in the first method, or may involve 
observing the effect of antibody binding on a 
homogeneous assay reagent, as in the second method. 

Also forming part of the invention is an assay 
system or kit for carrying out the assay method just 
described. The kit generally includes a support with 
surface-bound recombinant protein antigen which is (a) 
immunoreactive with antibodies present in individuals 
infected with enterically transmitted nonA/nonB viral 
agent and (b) derived from a viral hepatitis agent 
whose genome contains a region which is homologous to 
the 1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KPl(ETl.l) carried in E. Coll strain BB4, and having 
ATCC deposit no. 67717. A reporter-labeled anti-human 
antibody in the kit is used for detecting surface-bound 
ant i-ET-NANB antibody. 
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B. Viral Genome Diagnostic Applications 

The genetic material of the invention can it- 
self be used in numerous assays as probes for genetic 
material present in naturally occurring infections. 
5 One method for amplification of target nucleic acids, 
for later analysis by hybridization assays, is known as 
the polymerase chain reaction or PCR technique. The 
PCR technique can be applied to detecting virus 
particles- of the invention in suspected pathological 
10 samples using oligonucleotide primers spaced apart from 
each other and based on the genetic sequence set forth 
above. The primers are complementary to opposite 
strands of a double stranded DNA molecule and are 
typically separated by from about 50 to 450 nt or more 
15 (usually not more than 2000 nt). This method entails 
preparing the specific oligonucleotide primers and then 
repeated cycles of target DNA denaturation, primer 
binding , and extension with a DNA polymerase to obtain 
DNA fragments of the expected length based on the 
20 primer spacing. Extension products generated from one 
primer serve as additional target sequences for the 
other primer. The degree of amplification of a target 
sequence is controlled by the number of cycles that are 
performed and is theoretically calculated by the simple 
25 formula 2 n where n is the number of cycles. Given that 
the average efficiency per cycle ranges from about 65% 
to 85%, 25 cycles produce from 0.3 to 4.8 million 
copies of the target sequence. The PCR method is de- 
scribed in a number of publications, including Saiki et 
30 al., Science (1985) 230:1350-1354; Saiki et al., Nature 
(1986) 321 s163 " 166 ? and Scharf et al., Science (1986) 
233:1076-1078. Also see U.S. Patent Nos. 4,683,194; 
4,683,195; and 4,683,202. 

The invention includes a specific diagnostic 
35 method for determination of ET-NANB viral agent, based 
on selective amplification of ET-NANB fragments. This 
method employs a pair of single-strand primers derived 
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from non-homologous regions of opposite strands of a 
DNA duplex fragment, which in turn is derived from an 
enterically transmitted viral hepatitis agent whose 
genome contains a region which is homologous to the 
1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4, and having 
ATCC deposit no. 57717. These "primer fragments," 
which form one aspect of the invention, are prepared 
from ET-NANB fragments such as described in Section III 
above. The method follows the process for amplifying 
selected nucleic acid sequences as disclosed in U.S. 
Patent No. 4,683,202, as discussed above. 

C Peptide Vaccine 

Any of the antigens of the invention can be 
used in preparation of a vaccine. A preferred starting 
material for preparation of a vaccine is the particle 
antigen isolated from bile. The antigens are 
preferably initially recovered as intact particles as 
described above. However, it is also possible to pre- 
pare a suitable vaccine from particles isolated from 
other sources or non-particle recombinant antigens. 
When non-particle antigens are used {typically soluble 
antigens), proteins derived from the viral envelope or 
viral capsid are preferred for use in preparing vac- 
cines. These proteins can be purified by affinity 
chromatography, also described above. 

If the purified protein is not immunogenic per 
se, it can be bound to a carrier to make the protein 
immunogenic. Carriers include bovine serum albumin, 
keyhole limpet hemocyanin and the like. It is desir- 
able, but not necessary, to purify antigens to be sub- 
stantially free of human protein. However, it is more 
important that the antigens be free of proteins, 
viruses, and other substances not of human origin that 
may have been introduced by way of. or contamination 
of, the nutrient medium, cell lines, tissues, or patho- 
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logical fluids from which the virus is cultured or 
obtained. 

Vaccination can be conducted in conventional 
fashion. For example, the antigen, whether a viral 
5 particle or a protein, can be used in a suitable 
diluent such as water, saline, buffered salines, 
complete or incomplete adjuvants, and the like. The 
iramunogen is administered using standard techniques for 
antibody induction, such as by subcutaneous administra- 

10 tion of physiologically compatible, sterile solutions 
containing inactivated or attenuated virus particles or 
antigens. An immune response producing amount of virus 
particles is typically administered per vaccinizing 
injection, typically in a volume of one milliliter or 

15 less. 

A specific example of a vaccine composition 
includes, in a pharmacologically acceptable adjuvant , a 
recombinant protein or protein mixture derived from an 
enterically transmitted nonA/nonB viral hepatitis agent 

20 whose genome contains a region which is homologous to 
the 1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KPl(ETl.l) carried in E. coli strain BB4, and having 
ATCC deposit no. 67717. The vaccine is administered at 
periodic intervals until a significant titer of anti- 

25 ET-NANB antibody is detected in the serum. The vaccine 
is intended to protect against ET-NANB infection. 

m 

Prophylactic and Therapeutic 
Antibodies and Antisera 

30 In addition to use as a vaccine, the composi- 

tions can be used to prepare antibodies to ET-NANB 
virus particles. The antibodies can be used directly 
as antiviral agents. To prepare antibodies, a host 
animal is immunized using the virus particles or, as 

35 appropriate, non-particle antigens native to the virus 
particle are bound to a carrier as described above for 
vaccines. The host serum or plasma is collected 
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following an appropriate time interval to provide a 
composition comprising antibodies reactive with the 
virus particle. The gamma globulin fraction or the IgG 
antibodies can be obtained, for example, by use of 
5 saturated ammonium sulfate or DEAE Sephadex, or other 
techniques known to those skilled in the art. The 
antibodies are substantially free of many of the 
adverse side effects which may be associated with other 
anti-viral agents such as drugs. 
10 Tne antibody compositions can be made even more 

compatible with the host system, by minimizing potential 
adverse immune system responses. This is accomplished 
by removing all or a portion of the Pc portion of a 
foreign species antibody or using an antibody of the 
15 same species as the host animal, for example, the use 
of antibodies from human/human hybridomas. 

The antibodies can also be used as a means of 
enhancing the immune response since antibody-virus 
complexes are recognized by macrophages. The anti- 
20 bodies can be administered in amounts similar to those 
used for other therapeutic administrations of anti- 
body. For example, pooled gamma globulin is admini- 
stered at 0.02-0.1 ml/lb body weight during the early 
incubation of other viral diseases such as rabies, 
measles and hepatitis B to interfere with viral entry 
into cells. Thus, antibodies reactive with the ET-NANB 
virus particle can be passively administered alone or 
in conjunction with another anti-viral agent to a host 
infected with an ET-NANB virus to enhance the immune 
response and/or the effectiveness of an antiviral drug. 

Alternatively, anti-ET-NANB-virus antibodies 
can be induced by administering anti-idiotype anti- 
bodies as immunogens. Conveniently, a purified antiET- 
NANB-virus antibody preparation prepared as descibed 
above is used to induce anti-idiotype antibody in a 
host animal. The composition is administered to the 
host animal in a suitable diluent. Following 
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administration, usually repeated administration, the 
host produces anti-idiotype antibody. To eliminate an 
immunogenic response to the Pc region, antibodies pro- 
duced by the same species as the host animal can be 
5 used or the Pc region of the administered antibodies 
can be removed. Following induction of anti-idiotype 
antibody in the host animal, serum or plasma is removed 
to provide an antibody composition. The composition 
can be purified as described above for anti-ET-NANB- 
10 virus antibodies, or by affinity chromatography using 
an ti-ET-NANB- virus antibodies bound to the affinity 
matrix. The anti-idiotype antibodies produced are 
similar in conformation to the authentic ET-NANB 
antigen and may be used to prepare an ET-NANB vaccine 
15 rather than using a ET-NANB particle antigen. 

When used as a means of inducing an t i -ET-NANB- 
virus antibodies in a patient, the manner of injecting 
the antibody is the same as for vaccination purposes, 
namely intramuscularly, intraperitoneally r subcutane- 
ously or the like in an effective concentration in a 
physiologically suitable diluent with or without adju- 
vant. One or more booster injections may be desirable. 
The anti-idiotype method of induction of an t i -ET-NANB- 
virus antibodies can alleviate problems which may be 
25 caused by passive administration of an ti-ET-NANB- virus 
antibodies, such as an adverse immune response, and 
those associated with administration of purified blood 
components, such as infection with as yet undiscovered 
viruses. 

30 The ET-NANB derived proteins of the invention 

are also intended for use in producing antiserum 
designed for pre- or post-exposure prophylaxis. Here 
an ET-NANB protein, or mixture of proteins is 
formulated with a suitable adjuvant and administered by 

35 injection to human volunteers, according to known 
methods for producing human antisera. Antibody 
response to the injected proteins is monitored, during 
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a several- week period following immunization, by 
periodic serum sampling to detect the presence an anti- 
ET-NANB serum antibodies, as described in Section IIA 
above . 

The antiserum from immunized individuals may be 
administered as a pre-exposure prophylactic measure for 
individuals who are at risk of contracting infection. 
The anitserum is also useful in treating an individual 
post-exposure, analogous to the use of high titer 
antiserum against hepatitis B virus for post-exposure 
prophylaxis . 

E. Monoclonal Antibodies 

For both in vivo use of antibodies to ET-NANB 
virus particles and proteins and anti-idiotype anti- 
bodies and diagnostic use, it may be preferable to use 
monoclonal antibodies. Monoclonal anti-virus particle 
antibodies or anti-idiotype antibodies can be produced 
as follows. The spleen or lymphocytes from an 
immunized animal are removed and immortalized or used 
to prepare hybridomas by methods known to those skilled 
in the art. To produce a human-human hybridoma, a 
human lymphocyte donor is selected. A donor known to 
be infected with a ET-NANB virus (where infection has 
been shown for example by the presence of anti-virus 
antibodies in the blood or by virus culture) may serve 
as a suitable lymphocyte donor. Lymphocytes can be 
isolated from a peripheral blood sample or spleen cells 
may be used if the donor is subject to splenectomy. 
Epstein-Barr virus (EBV) can be used to immortalize 
human lymphocytes or a human fusion partner can be used 
to produce human-human hybridomas. Primary in vitro 
immunization with peptides can also be used in the 
generation of human monoclonal antibodies. 

Antibodies secreted by the immortalized cells 
are screened to determine the clones that secrete anti- 
bodies of the desired specificity. For monoclonal 
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anti-virus particle antibodies, the antibodies must 
bind to ET-NANB virus particles. For monoclonal anti- 
idiotype antibodies, the antibodies must bind to anti- 
virus particle antibodies. Cells producing antibodies 
5 of the desired specificity are selected. 

The following examples illustrate various 
aspects of the invention, but are in no way intended to 
limit the scope thereof. 

10 Material 

The materials used in the following Examples 
were as follows: 

Enzymes: DNAse I and alkaline phosphatase were 
obtained from Boehringer Mannheim Biochemicals (BMB, 
15 Indianapolis, IN); EcoRI , EcoRI methylase, DNA ligase, 
and DNA Polymerase I, from New England Biolabs (NEB, 
Beverly MA); and RNase A was obtained from Sigma (St. 
Louis, MO) . 

Other reagents: EcoRI linkers were obtained 
20 from NEB; and nitro blue tetrazolium (NBT), 
5-bromo-4-chloro-3-indolyl phosphate (BGIP) 

5-bromo-4-chloro-3-indolyl-B-D-galactopyranoside (X- 
gal) and isopropyl B-D-thiogalactopyranoside (IPTG) 
were obtained from Sigma. 

25 cDN A synthesis kit and random priming labeling 

kits are available from Boehringer-Mannheim Biochemical 
(BMB, Indianapolis, IN) . 

Eample 1 

30 Preparing cDNA Library 

A. Source of ET-NANB virus 

Two cynomolgus monkeys (cynos) were 
intravenously injected with a 10% suspension of a stool 
35 pool obtained from a second-passage cyno (cyno #37) 

infected with a strain of ET-NANB virus isolated from 
Burma cases whose stools were positive for ET-NANB, as 
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evidenced by binding of 27-34 nm virus-like particles 
{VLPs) in the stool to immune serum from a known ET- 
NANB patient. The animals developed elevated levels, of 
alanine aminotransferase (ALT) between 24-36 days after 
innoculation, and one excreted 27-34 nm VLPs in its 
bile in the pre-acute phase of infection. 

The bile duct of each infected animal was 
cannulated and about 1-3 cc of bile was collected 
daily, rna was extracted from one bile specimen (cyno 
#121) by hot phenol extraction, using a standard RNA 
isolation procedure. Double-strand cDNA was formed 
from the isolated RNA by a random primer for first- 
strand generation, using a cDNA synthesis kit obtained 
from Boehringer-Mannheim (Indianapolis, IN). 



B. Cloning the Duplex Fragments 

The duplex cDNA fragments were blunt-ended with 
T4 DNA polymerase under standard conditions {Maniatis, 
p. 118), then extracted with phenol/chloroform and 
precipitated with ethanol. The blunt-ended material 
was ligated with EcoRI linkers under standard 
conditions (Maniatis, pp. 396-397) and digested with 
EcoRI to remove redundant linker ends. Non-ligated 
linkers were removed by sequential isopropanol 
25 precipitation. 

Lambda gtlO phage vector (Huynh) was obtained 
from Promega Biotec (Madison, WI). This cloning vector 
has a unique EcoRI cloning site in the phage cl 
repressor gene. The cDNA fragments from above were 
introduced into the EcoRI site by mixing 0.5 - l.o ug 
EcoRI-cleaved gtlO, 0.5-3 u l of the above duplex 
fragments, 0.5 v l 10X ligation buffer, 0.5 pi ligase 
(200 units), and distilled water to 5 ul. The mixture 
was incubated overnight at 14«C, followed by in vitro 
packaging, according to standard methods (Maniatis, pp. 
256-268). 
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The packaged phage were used to infect an E. 
coli hfl strain, such as strain HG415. Alternatively, 
E- coli , strain C600 hfl r avialable from Promega 
Biotec, Madison, WI, could be used. The percentage of 
5 recombinant plaques obtained with insertion of the 
EcoRI-ended fragments was less than 5% by analysis of 
20 random plaques* 

The resultant cDNA library was plated and phage 
were eluted from the selection plates by addition of 

10 elution buffer. After DNA extraction from the phage, 
the DNA was digested with EcoRI to release the 
heterogeneous insert population, and the DNA fragments 
were fractionated on agarose to remove phage 
fragments. The 500-4,000 basepair inserts were 

15 isolated and recloned into lambda gtlO as above, and 
the packaged phage was used to infect B. coli strain 
HG415. The percentage of successful recombinants was 
greater than 95%. The phage library was plated on 
coli strain HG415, at about 5,000 plaques/plate, on a 

20 total of 8 plates. 



25 



30 
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Example 2 
Selecting ET-NANB Cloned Fragments 
A. cDNA Probes 

Duplex cDNA fragments from noninfected and ET- 
5 NANB-infected cynomolgus monkeys were prepared as in 
Example 1. The cDNA fragments were radiolabeled by 
random priming, using a random-priming labeling kit 
obtained from Boehringer-Mannheim (Indianapolis/ IN). 

10 B. Clone Selection 

The plated cDNA library from Example 1 was 
transferred to each of two nitrocellulose filters, and 
the phage DNA was fixed on the filters by baking, 
according to standard methods (Maniatis, pp. 320- 

15 323). The duplicate filters were hybridized with 
either, infected-source or control cDNA probes from 
above. Autoradiographs of the filters were examined to 
identify library clones which hybridized with 
radiolabeled cDNA probes from infected source only, 

20 i.e., did not hybridize with cDNA probes from the non- 
infected source. Sixteen such clones, out of a total 
of about 40,000 clones examined, were identified by 
this subtraction selection method. 

Each of the sixteen clones was picked and 

25 replated at low concentration on an agar plate. The 
clones on each plate were transferred to two nitro- 
cellulose as duplicate lifts, and examined for hybrid- 
ization to radiolabeled cDNA probes from infected and 
noninfected sources, as above. Clones were selected 

30 which showed selective binding for infected-source 

probes (i.e., binding with infected-source probes and 
substantially no binding with non-infected-source 
probes). One of the clones which bound selectively to 
probe from infected source was isolated for further 

35 study. The selected vector was identified as lambda 
gtlO-1.1, indicated in Figure 1. 
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Example 3 
ET-NANB Sequence 
Clone lambda gtlO-1.1 from Example 2 was 
digested with EcoRI to release the heterologous insert, 
5 which was separated from the vector fragments by gel 
electrophoresis. The electrophoretic mobility of the 
fragment was consistent with a 1.33 kb fragment. This 
fragment, which contained Eco RI ends, was inserted into 
the Eco RI site of a pTZ-KFl vector, whose construction 

10 and properties are described in co-owned U.S. patent 
application for "Cloning Vector System and Method for 
Rare Clone Identif ication", Serial No. 125, 650, filed 
November 25, 1987. Briefly, and as illustrated in 
Figure 1, this plasmid contains a unique EcoR I site 

15 adjacent a T7 polymerase promoter site, and plasmid and 
phage origins of replication. The sequence immediately 
adjacent each side of the EcoRI site is known. E. coli 
BB4 bacteria, obtained from Stratagene (La Jolla, CA, 
were transformed with the plasmid. 

20 Radiolabeled ET-NANB probe was prepared by 

excising the 1.33 kb insert from the lambda gtlO-1.1 
phage in Example 2, separating the fragment by gel 
electrophoresis, and randomly labeling as above. 
Bacteria transfected with the above pTZ-KFl and 

25 containing the desired ET-NANB insert were selected by 
replica lift and hybridization with the radiolabeled 
ET-NANB probe, according to methods outlined in Example 
2. 

30 One bacterial colony containing a successsful 

recombinant was used for sequencing a portion of the 
1.33 kb insert. This isolate, designated pTZ- 
KFl(ETl.l), has been deposited with the American Type 
* Culture Collection, and is identified by ATCC deposit 

35 no. 67717. Using a standard dideoxy sequencing 

procedure, and primers for the sequences flanking the 
EcoRI site, about 200-250 basepairs of sequence from 
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the 5 '-end region and 3 l -end region of the insert were 
obtained* The sequences are given above in Section 
II. Later sequencing by the same techniques gave the 
full sequence in both directions, also given above." 

Example 4 
Detecting ET-NANB Sequences 
cDNA fragment mixtures from the bile of 
noninfected and ET-NANB-infected cynomolgus monkeys 
were prepared as above. The cDNA fragments obtained 
from human stool samples were prepared as follows. 
Thirty ml of a 10% stool suspension obtained from an 
individual from Mexico diagnosed as infected with ET- 
NANB as a result of an ET-NANB outbreak, and a similar 
15 volume of stool from a healthy, non-infected 

individual, were layered over a 30% sucrose density 
gradient cushion, and centrifuged at 25,000 xg for 6 hr 
in an SW27 rotor, at.l5°C. The pelleted material from 
the infected-source stool contained 27-34 run VLP 
20 particles characteristic of ET-NANB infection in the 
infected-stool sample. RHA was isolated from the 
sucrose-gradient pellets in both the infected and non- 
infected samples, and the isolated RNA was used to 
produce cDNA fragments as described in Example 1. 

25 

The cDNA fragment mixtures from infected and 
non-infected bile source, and from infected and non- 
infected human-stool source were each amplified by a 
novel linker/primer replication method described in co- 

30 owned patent application serial number 07/208,512 for 
"DNA Amplification and Subtraction Technique," filed 
June 17, 1988. Briefly, the fragments in each sample 
were blunt-ended with DNA Pol I then extracted with 
phenol/chloroform and precipitated with ethanol. The 

35 blunt-ended material was ligated with linkers having 
the following sequence: 
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5 ' -GGAATTCGCGGCCGCTCG-3 1 
3 1 -TTCCTTAAGCGCCGGCGAGC- 5 1 

The duplex fragements were digested with Nrul 
5 to remove linker dimers, mixed with a primer having the 
sequence 5 1 -GGAATTCGCGGCCGCTCG-3 1 , and then heat 
denatured and cooled to room temperature to form 
single-strand DNA/primer complexes. The complexes were 
replicated to form duplex fragments by addition of 
10 Thermus aquaticus (Taq) polymerase and all four 
deoxynucleotides. The replication jprocedures, 
involving successive strand denaturation f formation of 
strand/primer complexes, and replication, was repeated 
25 times. 

15 The amplified cDNA sequences were fractionated 

by agarose gel electrophoresis, using a 2% agarose 
matrix. After transfer of the DNA fragments from the 
agarose gels to nitrocellulose paper, the filters were 
hybridized to a random-labeled 32 P probe prepared by 

20 (i) treating the pTZ-KFl(ETl.l) plasmid from above with 
EcoR I , (ii) isolating the released 1.33 kb ET-NANB 
fragment, and (iii) randomly labeling the isolated 
fragment. The probe hybridization was performed by 
conventional Southern blotting methods (Maniatis, pp. 

25 382-389). Figure 2 shows the hybridization pattern 

obtained with cDNAs from infected (I) and non-infected 
(N) bile sources (2A) and from infected (I) and non- 
infected (N) human stool sources (2B). As seen, the 
ET-NANB probe hybridized with fragments obtained from 

30 both of the infected sources, but was non-homologous to 
sequences obtained from either of the non-infected 
sources, thus confirming the specificity of derived 
sequence • 

Southern blots of the radiolabeled 1.33 kb 
35 fragment with genomic DNA fragments from both human and 
cynomologus-monkey DNA were also prepared. No probe 
hybridization to either of the genomic fragment 
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mixtures was observed, confirming that the ET-NANB 
sequence is exogenous to either human or cynomolgus 
genome. 

Example 5 
Expressing ET-NANB Proteins 

ft. Preparing ET-NANB Coding Sequences 

The pTZ-KFl(ETl.l) plasmid from Example 2 was 
digested with EcoRI to release the 1.33 kb ET-NANB 
insert which was purified from the linearized plasmid 
by gel electrophoresis. The purified fragment was 
suspended in a standard digest buffer (0.5M Tris HC1, 
pH 7.5; 1 mg/ml BSA; lOmM MnCl 2 ) to a concentration of 
15 about 1 mg/ml and digested with DNAse I at room 
temperature for about 5 minutes. These reaction 
conditions were determined from a prior calibration 
study, in which the incubation time required to produce 
predominantly 100-300 basepair fragments was 
determined. The material was extracted with 
phenol/chloroform before ethanol precipitation. 

The fragments in the digest mixture were blunt- 
ended and ligated with EcoRI linkers as in Example 1. 
The resultant fragments were analyzed by 
25 electrophoresis (5-10V/cm) on 1.2% agarose gel, using 
PhiX174/HaeIIl and lambda/Hindm size markers. The 
100-300 bp fraction was eluted onto NA45 strips 
(Schleicher and Schuell), which were then placed into 
1.5 ml microtubes with eluting solution (1 M NaCl, 50 
mM arginine, pff 9.0), and incubated at 67°C for 30-60 
minutes. The eluted DNA was phenol/chloroform 
extracted and then precipitated with two volumes of 
ethanol. The pellet was resuspended in 20 w l TE (0.01 
M Tris HC1, pH 7.5, 0.001 M EDTA) . 

35 
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B. Cloning in an Expression Vector 

Lambda gtll phage vector (Huynh) was obtained 
from Promega Biotec (Madison, WI). This cloning vector 
has a unique EcoRI cloning site 53 base pairs upstream 
from the beta-galactosidase translation termination 
codon. The genomic fragments from above were 
introduced into the EcoRI site by mixing 0.5-1.0 pg 
EcoRI-cleaved gtll, 0.3-3 ul of the above sized 
fragments, 0.5 v l 10X ligation buffer (above), 0.5 w l 
ligase (200 units), and distilled water to 5 w i. The 
mixture was incubated overnight at 14°C, followed by in 
vitro packaging, according to standard methods 
(Maniatis, pp. 256-268). 

The packaged phage were used to infect e. coli 
15 strain KM392, obtained from Dr. Kevin Moore, DNAX (Palo 
Alto, CA). Alternatively, E. Coli strain Y1090, 
available from the American Type Culture Collection 
(ATCC #37197), could be used. The infected bacteria 
were plated and the resultant colonies were checked for 
loss of beta-galactosidase activity-(clear plaques) in 
the presence of X-gal using a standard X-gal substrate 
plaque assay method (Maniatis). About 50% of the phage 
plaques showed loss of beta-galactosidase enzyme 
activity (recombinants). 

25 

C. Screening for ET-NANB Recombinant Proteins 

ET-NANB convalescent .antiserum was obtained 
from patients infected during documented ET-NANB 
outbreaks in Mexico, Borneo, Pakistan, Somalia, and 
Burma. The sera were immunoreactive with VLPs in stool 
specimens from each of several other patients with ET- 
NANB hepatitis. 

A lawn of E. coli KM392 cells infected with 
about 10 4 pfu of the phage stock from above was 
35 prepared on a 150 mm plate and incubated, inverted, for 
5-8 hours at 37°C. The lawn was overlaid with a 
nitrocellulose sheet, causing transfer of expressed ET- 
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NANB recombinant protein from the plaques to the 
paper. The plate and filter were indexed for matching 
corresponding plate and filter positions. 

The filter was washed twice in TBST buffer (10 
5 mM Tris, pH 8.0, 105 raM NaCl, 0.05% Tween 20) , blocked 
with AIB (TBST buffer with 1% gelatin), washed again in 
TBST, and incubated overnight after addition of 
antiserum (diluted to 1:50 in AIB, 12-15 ml/plate). 
The sheet was washed twice in TBST and then contacted 

10 with enzyme-labeled anti-human antibody to attach the 
labeled antibody at filter sites containing antigen 
recognized by the antiserum. After a final washing, 
the filter was developed in a substrate medium 
containing 33 ul NBT (50 mg/ml stock solution 

15 maintained at 5°C) mixed with 16 pi BCIP (50 mg/ml 

stock solution maintained at 5°C) in 5 ml of alkaline 
phosphatase buffer (100 mM Tris, 9.5, 100 mM NaCl, 5 mM 
MgCl 2 ). Purple color appeared at points of antigen 
production, as recognized by the antiserum. 

20 

D. Screening Plating 

The areas of antigen production determined in 
the previous step were replated at about 100-200 pfu on 
an 82 mm plate. The above steps, beginning with a 5-8 
25 hour incubation, through NBT-BCIP development, were 
repeated in order to plaque purify phage secreting an 
antigen capable of reacting with the ET-NANB 
antibody. The identified plaques were picked and 
eluted in phage buffer (Maniatis, p. 443). 

30 

E. Epitope Identification 

A series of subclones derived from the original 
pTZ-KPl (ET1.1) plasinid from Example 2 were isolated 
using the same techniques described above. Each of 
35 these five subclones were immunoreactive with a pool of 
anti-ET antisera noted in c. The subclones contained 
short sequences from the "reverse" sequence set forth 
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previously. The beginning and ending points of the 
sequences in the subclones (relative to the full 
"reverse" sequence), are identified in the table 
below. 

5 



TABLE 1 

Subclone Position in "Reverse" Sequence 



10 5 " end 3* -end 

Yl 522 643 

Y2 594 667 

Y3 508 665 

Y4 558 752 

15 Y5 545 665 



Since all of the gene sequences identified in 
the table must contain the coding sequence for the 

20 epitope, it is apparent that the coding sequence for 
the epitope falls in the region between nucleotide 594 
(5' -end) and 643 (3'-end). Genetic sequences 
equivalent to and complementary to this relatively 
short sequence are therefore particularly preferred 

25 aspects of the present invention, as are peptides 
produced using this coding region, 

A second series of clones identifying an 
altogether different epitope was isolated with only 
Mexican serum. 

30 
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TABLE 2 

Subclone Position in "Forward" Sequence 

5'end 3* end 

ET 2-2 2 193 

ET 8-3 2 13 5 

ET 9-1 2 109 

ET 13-1 2 X01 



10 



The coding system for this epitope falls 
between nucleotide 2 (5'-end) and 101 (3'-end). 
Genetic sequences related to this short sequence are 
15 therefore also preferred, as are peptides produced 
using this coding region. 

While the invention has been described with 
reference to particular embodiments, methods, 
construction and use, it will be apparent to those 
20 skilled in the art that various changes and 

modifications can be made without departing from the 
invention. 
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WE CLAIM: 



1. A protein derived from an enter ically transmitted 
non-A/non-B viral hepatitis agent whose genome 
5 contains a region which is homologous to the 1.33 

kb DNA Eco RI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4 and having 
ATCC deposit no. 67717. 

10 2. The protein of claim 1, which is encoded by a 
coding region within said 1.33 kb Eco RI insert. 

3. A recombinant protein derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
15 genome contains a region which is homologous to a 

duplex DNA having a first sequence: 



1 11 21 31 41 51 

****** 

2 0 CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 

61 71 81 91 101 ill 

****** 

TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 

25 

121 131 141 151 161 171 

* * . * * * * 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 
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iSl 191 201 211 221 



231 



GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

5 241 251 261 271 281 291 

****** 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 

301 311 321 331 341 351 

10 * * * * * # 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 

361 371 381 391 401 411 

* * * * * * 

15 CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTGACGTGTCCAGGATCACC 

421 431 441 451 461 471 

* * * * * * 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 

481 4 91 501 511 521 531 

* * * * * * 

GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCTTGGTTCCGC 

25 541 551 561 571 581 591 

****** 

GCTATTGAGAAGGCTATTCTGGCCCTGCTCCCTCAGGGTGTGTTTTACGGTGATGCCTTT 

601 611 621 631 641 651 

30 * * * * * * 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGCATCCATGGTGTTTGAGAAT 



20 



661 671 681 691 

***** 

3 5 GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTG 



701 711 

* 

TGCTATT 



WO 89/1 2462 PCT/US89/02648 

57 

721 731 741 751 761 771 

* * * * * . * 

ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 

5 781 791 801 811 821 831 

* * * * * * 

TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

841 851 861 871 881 891 

10 * * * * * * 

CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 

901 911 921 931 941 951 

* * * * * * 

15 TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 

961 971 981 991 1001 1011 

* * * * * * 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

20 

1021 1031 1041 1051 1061 1071 

* ***** 

GATTTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 

25 1081 1091 1101 1111 1121 1131 

* * * * * * 

CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 

1141 1151 1161 1171 1181 1191 

30 * * * * * * 

CGGGCGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 

1201 1211 1221 1231 1241 1251 

* * * * * * 

3 5 CAGATGTGTGTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 
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1261 1271 1281 1291 1301 1311 

* * * * * * 

CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

1321 1331 1341 



CCAGTGCTCGACCGGAATTCGAGC , 



a second sequence 



1 11 . 21 31 41 51 

* * * * * * 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 

15 61 71 81 91 id X11 

* * * * * * 

AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 

121 "1 141 151 161 171 

20 * * * * * * 

GGAAACAACATCCACACACATCTGAGCTACATTCGXGAG.CTTGCGGAGGAAATCACTAAC 

181 191 201 211 221 231 

* * * * * * 

AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

.241 251 261 271 281 291 

****** 

GCCa3C(^GCGO^(^TCAGGGAGCGCGCC^GGCCGGGGGCCACCACAACACCTGC 

301 311 321 331 341 351 

****** 



ATACAAACCGATCGGGCGGAAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGG 



AC 



35 361 371 381 391 401 411 

****** 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 
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421 431 441 451 461 471 

* * * * * * 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAATAACGGCCATATTCCAGAC 

5 

481 491 501 511 521 531 

****** 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 

10 541 551 561 571 581 591 

****** 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

601 611 621 631 641 651 

15 * * * * * * 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

661 671 681 691 701 711 

* * * * * * 

2 0 GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 

721 731 741 751 761 771 

****** 

AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 

25 

781 791 801 811 821 831 

♦ ****** 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 

30 841 851 861 871 881 891 

* * * * * * 

GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 

901 911 921 931 941 951 

35 * * * * * * 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 
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961 971 981 991 1001 igil 

* * * * * * 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 

5 1021 1031 1041 1051 1061 1071 

* 

TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 

1081 1091 1101 1111 1121 1131 

10 * * * * * 4 

AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 
1141 1151 1161 1171 Hal H9i 



15 



20 



30 



35 



* * * * * 



* 



GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 

1201 1211 1221 1231 1241 1251 

* * * * * * 

TCAAATGTTACGACACTATCACAGGTGGIGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 
1261 1271 1281 1291 



1301 1311 

* 

TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATTAGTG 



* ■ * * * * 



25 1321 1331 1341 

* * * 

CCTTCCATCAGTTGGCGGAATTCG , 



a third sequence 

1 CGGTTGTTCA GTACCAGTTT ACTGCAGGTG TGCCTGGATC CGGCAAGTCC 
51 CGCTCTATCA CCCAAGCCGA TGTGGACGTT GTCGTGGTCC CGACGCGTGA 
101 GTTGCGTAAT GCCTGGCGCC GTCGCGGCTT TGCTGCTTTT ACCCCGCATA 
151 CTGCCGCCAG AGTCACCCAG GGGCGCCGGG TTGTCATTGA TGAGGCTCCA 
201 TCCCTCCCCC CTCACCTGCT GCTGCTCCAC ATGCAGCGGG CCGCCACCGT 
251 CCACCTTCTT GGCGCCCCGA ACCAGATCCC AGCCATCGAC TTTGAGCACG 
301 CTGGGCTCGT CCCCGCCATC AGGCCCGACT TAGCCCCACC TCCTGGTGGC 
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351 ATGTTACCCA TCGCTGCCCT 
401 TACCCCATGA TCCAGACCAC 
451 TGAGCCTGCC GTCGGGCAGA 
501 CCAACCCCGG CTCAGTGACG 
5 551 GAGACCACTA TTATTGCCAC 

601 TCGGGCTCAT GCCATTGTTG 
651 TCATTGACGC ACCAGGCCTG 
701 GTTAATAACT TTTTCCTCGC 
751 AGTTATTCCC CGTGGCAACC 
10 801 TCCCGCCGTC TTGCCAGATT 

851 GGCCACAGAC CTGTCCCTGT 
901 CGAACAGGGC CTTCTCTACC 
951 TCGTAACATT TGAATTAACA 
1001 AGCCAGCGCA AGGCCGTGCT 
15 1051 ACAAAGCTCT ACAATGCTTC 

1101 TTTTATCCCG GCCATTGGCC 
1151 AGCTAGTGGA GGCCATGGTC 
1201 GAGCTTGATC TTTGCAACCG 
1251 AGATTGTAAC AAGTTCACCA 
20 1301 GCCAGGGCAT CTCGGCCTGG 
1351 TGGTTCCGCG CTATTGAGAA 
1401 GTTTTACGGT GATGCCTTTG 
1451 CAGCAAAGGC ATCCATGGTG 
1501 ACCCAGAATA ACTTTTCTCT 
25 1551 TGGGATGCCG CAGTGGCTCA 

1601 GGATCTTGCA GGCCCCGAAG 
1651 TCCGGTGAGC CCGGCACTCT 
1701 TATTACCCAC TGTTATGACT 
1751 GTGATGATTC GATAGTGCTT 
30 1801 GCTGTCCTGA TCGCCGGCTG 

1851 GATCGGTTTG TATGCAGGTG 
1901 CTGATGTTGT GCGCTTCGCC 
1951 GGCCCTGAGC GGGCGGAGCA 
2001 CAAGCTCACG AATGTAGCTC 
35 2051 ATGGGGTTTC CCCTGGACTC 

2101 GTTGCTGATG GCAAGGCACA 
2151 CTTGACAAAT TCAATCTTGT 



GCGGATGTAT GCGAGCTCAT CCGTGGTGCA 
TAGCCGGGTT CTCCGTTCGT TGTTCTGGGG 
AACTAGTGTT CACCCAGGCG GCCAAGGCCG 
GTCCACGAGG CGCAGGGCGC TACCTACACG 
AGCAGATGCC CGGGGCCTTA TTCAGTCGTC 
CTCTGACGCG CCACACTGAG AAGTGCGTCA 
CTTCGCGAGG TGGGCATCTC CGATGCAATC 
TGGTGGCGAA ATTGGTCACC AGCGCCCATC 
CTGACGCCAA TGTTGACACC CTGGCTGCCT 
AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 
TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
TGCCCCAGGA GCTCACCACC TGTGATAGTG 
GACATTGTGC ACTGCCGCAT GGCCGCCCCG 
GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
CCACTCTGAT GTTCGCGACT CTCTCGCCCG 
CCGTACAGGT TACAACTTGT GAATTGTACG 
GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
TGACGTGTCC AGGATCACCT TCTTCCAGAA 
CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 
GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
TTTGAGAATG ACTTTTCTGA GTTTGACTCC 
GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
TCCGCCTGTA TCACCTTATA AGGTCTGCGT 
GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 
TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 
TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 
AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
GTTCATAACC TGATTGGCAT GCTACAGGCT 
TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
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2201 


TGCGCCCATG 


GGTTCGCGAC 


CATGCGCCCT 


CGGCCTATTT 


TGTTGCTGCT 


2251 


CCTCATGTTT 


TTGCCTATGC 


TGCCCGCGCC 


ACCGCCCGGT 


CAGCCGTCTG 


2301 


GCCGCCGTCG 


TGGGCGGCGC 


AGCGGCGGTT 


CCGGCGGTGG 


TTTCTGGGGT 


2351 


GACCGGGTTG ATTCTCAGCC CTTCGCAATC 


CCCTATATTC 


ATCCAACCAA 


2401 


CCCCTTCGCC 


CCCGATGTCA 


CCGCTGCGGC 


CGGGGCTGGA 


CCTCGTGTTC 


2451 


GCCAACCCGC 


CCGACCACTC 


GCGTCCGCTT 


GGCGTGACCA 


GGCCCAGCGC 


2501 


CCCGCCGTTG 


CCTCACGTCG 


TAGACCTACC 


ACAGCTGGGG 


CCGCGCCGCT 


2551 


AACCGCGGTC 


GCTCCGGCCC 


CG f 







10 or a fourth sequence 

1 GCGGCCGCTC GGTGGGGTTC CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
51 TAAGCAAGGA ATTAATTCGC GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 
101 CAGATGCCAT TGTTAATAAT TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
15 151 CAGAGACCAT CGGTCATTCC GCGAGCAACC CTGACCGCAA TGTTGACGTG 

201 CTTGCGGCGT TTCCACCTTC ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
250 TGAGGAGCTG GGCCACCGGC GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
301 GAGCTTGAGC AGGGCCTTCT CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 
350 CAGTGTTGTG ACATTTGAGC TAACTGACAT TGTGCACTGC CGCATGGCGG 
20 401 CCCCTAGCCA AAGGAAAGCT GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 

451 AGACGCACAA GGCTTTATGA TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
501 TGCGCGCTTT ATTCCCACTC TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
551 TCTTTGAGCT TGTAGAGGCG ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 
601 GTCCTCGAGT TGGATTTGTG CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
25 651 CCAGAAGGAT TGTAACAAGT TCACGACCGG CGAGACAATT GCGCATGGCA 

701 AAGTCGGTCA GGGTATCTCC GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
751 GGCCCCTGGT TCCGTTGCGA TTGAGAAGGC TATTCTATCC CTTTTACCAC 
801 AAGCTGTGTT CTACGGGGAT GCTTATGACG ACTCAGTATT CTCTGCTGCC 
851 GTGGCTGGCG CCAGCCATGC CATGGTGTTT GAAAATGATT TTTCTGAGTT 
30 901 TGACTCGACT CAGAATAACT TTTCCCTAGG TCTTGAGTGC GCCATTATGG 

951 AAGAGTGTGG TATGCCCCAG TGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
1001 TCGGCGTGGA TCCTGCAGGC CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
1051 GAAGCATTCT GGTGAGCCGG GCACGTTGCT CTGGAATACG GTGTGGAACA 
1101 TGGCAATCAT TGCCCATTGC TATGAGTTCC GGGACCTCCA GGTTGCCGCC 
35 1151 TTCAAGGGCG ACGACTCGGT CGTCCTCTGT AGTGAATACC GCCAGAGCCC 

1201 AGGCGCCGGT TCGCTTATAG CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
1251 TCCGGCCGAT TGGGCTGTAT GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
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1301 GCCCTACCCG ATGTCGTTCG ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 
1351 GGGGCCTGAT CCGGAGCGGG CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
1401 TCCTCCGTAG GTTAACGAAT GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 
1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
1651 CGAGCGGCCG C, 



10 



or a sequence complementary thereto. 



4. A protein which is (a) immunoreactive with 
antibodies present in individuals infected with * 
enter ically transmitted nonA/nonB and (b) derived 

15 from a viral hepatitis agent whose genome contains 

a region which is homologous to the 1.33 kb DNA 
EcoRI insert present in plasmid pTZ-KFl(ETl.l) 
carried in E. coli strain BB4, and having ATCC 
deposit no. 67717. 

20 

5. The protein of claim 4, which is encoded by a 
coding region within said 1.33 kb* EcoRI insert. 

6. A method of detecting infection by enterically 
25 transmitted nonA/nonB hepatitis viral agent in a 

test individual, comprising: 

providing a peptide antigen which is (a) 
immunoreactive with antibodies present in 
individuals infected with enterically transmitted 

30 nonA/nonB hepatitis and (b) derived from a viral 

hepatitis agent whose genome contains a region 
which is homlogous to the 1.33 kb DNA Eco RI insert 
present in plasmid pTZ-KFl(ETl.l) carried in E. 
coli strain BB4, and having ATCC deposit no. 67717, 

35 reacting serum from the test individual 

with such antigen, and 

examining the antigen for the presence of 
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20 



25 



30 



35 



bound antibody . 

7. The method of claim 6, wherein the serum antibody 
is an igM or IgG antibody, or a mixture of both r 
the antigen. provided is attached to a support, said 
reacting includes contacting such serum with the 
support and said examining includes reacting the 
support and bound serum antibody with a reporter- 
labeled anti-human antibody. 



8. A kit for ascertaining the presence of serum 
antibodies which are diagnostic of enterically 
transmitted nonA/nonB hepatitis infection, 
comprising 

15 a support with surface-bound recombinant 

peptide antigen which is (a) immunoreactive with 
antibodies present in individuals infected with 
enterically transmitted nonA/nonB viral hepatitis 
agent and (b) derived from a viral hepatitis agent 
whose genome contains a region which is homologous 
to the 1,33 kb DNA EcoRI insert present in plasmid 
pTZ-KFl(ETl.l) carried in E. coli strain BB4, and 
having ATCC deposit no. 67717, 

a reporter-labeled anti-human antibody. 



9. A DNA fragment derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
genome contains a region which is homologous to the 
1.33 kb DNA EcoRI insert present in plasmid pTZ- 
KFl(ETl.l) carried in E. coli strain BB4 and having 
ATCC deposit no. 67717. 

10. The fragment of claim 9, which is derived from said 
1.33 kb EcoR I insert. 

11. A DNA fragment derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
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genome contains a region which is homologous to a 

duplex DNA fragment within a first sequence: 

1 11 21 31 41 51 

****** 

5 CGAATTCCGCCAACTGATGGAAGGCACTAATCTGGCAAGACCTGTCCCTGTTGCAGCTGT 

61 71 81 91 101 111 

****** 

TCTACCACCCTGCCCCGAGCTCGAACAGGGCCTTCTCTACCTGCCCCAGGAGCTCACCAC 

10 

121 131 141 151 161 171 

****** 

CTGTGATAGTGTCGTAACATTTGAATTAACAGACATTGTGCACTGCCGCATGGCCGCCCC 

15 181 191 201 211 221 231 

****** 

GAGCCAGCGCAAGGCCGTGCTGTCCACACTCGTGGGCCGCTACGGCGTCGCACAAAGCTC 

241 251 261 271 281 291 

20 * * * * * * 

TACAATGCTTCCCACTCTGATGTTCGCGACTCTCTCGCCCGTTTTATCCCGGCCATTGGC 

301 311 321 331 341 351 

25 * * * * * * 

CCCGTACAGGTTACAACTTGTGAATTGTACGAGCTAGTGGAGGCCATGGTCGAGAAGGGC 

361 371 381 391 401 411 

****** 

3 0 CAGGATGGCTCCGCCGTCCTTGAGCTTGATCTTTGCAACCGTG ACGTGTCCAGGATCACC 

421 431 441 451 461 471 

* * * * * * 

TTCTTCCAGAAAGATTGTAACAAGTTCACCACAGGTGAGACCATTGCCCATGGTAAAGTG 

35 

481 491 501 511 521 531 
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GGCCAGGGCATCTCGGCCTGGAGCAAGACCTTCTGCGCCCTCTTTGGCCCTTGGTTCCGC 

54 1 551 561 571 581 59! 

****** 

GCTATTGAGAAGGCTATTCTGGCCCTGCTCCCTCAGGGTGTGTTTTACGGTGATGCCTTT 

601 611 621 631 641 651 

* * * * * * 

GATGACACCGTCTTCTCGGCGGCTGTGGCCGCAGCAAAGGCATCCATGGTGTTTGAGAAT 



10 



661 671 681 691 • 701 711 

* * * * * * 

GACTTTTCTGAGTTTGACTCCACCCAGAATAACTTTTCTCTGGGTCTAGAGTGTGCT 



ATT 



15 721 731 741 751 761 771 

* * * * * * 

ATGGAGGAGTGTGGGATGCCGCAGTGGCTCATCCGCCTGTATCACCTTATAAGGTCTGCG 



781 * 791 801 811 821 



831 



20 * * * * * * 

TGGATCTTGCAGGCCCCGAAGGAGTCTCTGCGAGGGTTTTGGAAGAAACACTCCGGTGAG 

841 851 861 871 881 891 

* * * * * * 

CCCGGCACTCTTCTATGGAATACTGTCTGGAATATGGCCGTTATTACCCACTGTTATGAC 



25 



901 921 931 941 951 

* * * * * * 

TTCCGCGATTTTCAGGTGGCTGCCTTTAAAGGTGATGATTCGATAGTGCTTTGCAGTGAG 

30 

961 971 9 81 991 1001 1011 

* * * 

TATCGTCAGAGTCCAGGAGCTGCTGTCCTGATCGCCGGCTGTGGCTTGAAGTTGAAGGTA 

35 1021 1031 1041 1051 1061 1071- 

* * * * 

GAITTCCGCCCGATCGGTTTGTATGCAGGTGTTGTGGTGGCCCCCGGCCTTGGCGCGCTC 
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1081 1091 1101 Ull H2i U31 

* * * * * * 

CCTGATGTTGTGCGCTTCGCCGGCCGGCTTACCGAGAAGAATTGGGGCCCTGGCCCTGAG 

5 

1141 1151 1161 1171 H81 ii 91 

****** 

CGGGCGGAGCAGCTCCGCCTCGCTGTTAGTGATTTCCTCCGCAAGCTCACGAATGTAGCT 

10 1201 1211 1221 1231 1241 1251 

* 

CAGATGTGTGTGGATGTTGTTTCCCGTGTTTATGGGGTTTCCCCTGGACTCGTTCATAAC 

1261 1271 1281 1291 1301 13U 

15 * * * * * , 

CTGATTGGCATGCTACAGGCTGTTGCTGATGGCAAGGCACATTTCACTGAGTCAGTAAAA 

1321 1331 1341 

* * * 

20 CCAGTGCTCGACCGGAATTCGAGC, 

a second sequence 

1 11 21 31 41 51 

25 * * * * , ^ 

GCTCGAATTCCGGTCGAGCACTGGTTTTACTGACTCAGTGAAATGTGCCTTGCCATCAGC 

61 71 81 91 101 in 

****** 

30 AACAGCCTGTAGCATGCCAATCAGGTTATGAACGAGTCCAGGGGAAACCCCATAAACACG 

121 131 141 151 161 171 

* ■ * * * * * 

GGAAACAACATCCACACACATCTGAGCTACATTCGTGAGCTTGCGGAGGAAATCACTAAC 

35 

181 I" 201 211 221 231 
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AGCGAGGCGGAGCTGCTCCGCCCGCTCAGGGCCAGGGCCCCAATTCTTCTCGGTAAGCCG 

241 251 261 271 281 291 

* * * * * * 

GCCGGCGAAGCGCACAACATCAGGGAGCGCGCCAAGGCCGGGGGCCACCACAACACCTGC 

301 311 321 331 341 351 

* * * * * * 

ATACAAACCGATCGGGCGGAAATCTACCTTCAACTTCAAGCCACAGCCGGCGATCAGGAC 



10 



361 371 . 381 391 - 401 411 

* * * * * * 

AGCAGCTCCTGGACTCTGACGATACTCACTGCAAAGCACTATCGAATCATCACCTTTAAA 

15 421 431 441 451 461 471 

* * * * * * 

GGCAGCCACCTGAAAATCGCGGAAGTCATAACAGTGGGTAAfTAACGGCCATATTCCAGAC 

481 491 501 511 521 531 

20 * * * * * * 

AGTATTCCATAGAAGAGTGCCGGGCTCACCGGAGTGTTTCTTCCAAAACCCTCGCAGAGA 



25 



30 



* 



35 



* 
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541 551 561 571 581 591 

* ***** 

CTCCTTCGGGGCCTGCAAGATCCACGCAGACCTTATAAGGTGATACAGGCGGATGAGCCA 

5 601 611 621 631 641 651 

* ***** 

CTGCGGCATCCCACACTCCTCCATAATAGCACACTCTAGACCCAGAGAAAAGTTATTCTG 

661 671 681 691 701 711 

10 * * * * * * 

GGTGGAGTCAAACTCAGAAAAGTCATTCTCAAACACCATGGATGCCTTTGCTGCGGCCAC 
721 731 741 751 761 771 



15 



20 



AGCCGCCGAGAAGACGGTGTCATCAAAGGCATCACCGTAAAACACACCCTGAGGGAGCAG 

781 791 801 811 821 831 

* ***** 

GGCCAGAATAGCCTTCTCAATAGCGCGGAACCAAGGGCCAAAGAGGGCGCAGAAGGTCTT 



841 851 861 871 881 



891 



GCTCCAGGCCGAGATGCCCTGGCCCACTTTACCATGGGCAATGGTCTCACCTGTGGTGAA 

25 901 911 921 931 941 951 

****** 

CTTGTTACAATCTTTCTGGAAGAAGGTGATCCTGGACACGTCACGGTTGCAAAGATCAAG 

961 971 981 991 1001 1011 

30 * * * * * * 

CTCAAGGACGGCGGAGCCATCCTGGCCCTTCTCGACCATGGCCTCCACTAGCTCGTACAA 

1021 1031 1041 1051 1061 1071 

* ***** 

3 5 TTCACAAGTTGTAACCTGTACGGGGCCAATGGCCGGGATAAAACGGGCGAGAGAGTCGCG 
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1081 1091 1101 1111 H21 u 31 

* * * * * * 

AACATCAGAGTGGGAAGCATTGTAGAGCTTTGTGCGACGCCGTAGCGGCCCACGAGTGTG 

5 1141 1151 1161 . 1171 1181 1191 

* ***** 

GACAGCACGGCCTTGCGCTGGCTCGGGGCGGCCATGCGGCAGTGCACAATGTCTGTTAAT 

1201 1211 1221 1231 1241 1251 

.0 * * * * * * 

TCAAATGTTACGACACTATCACAGGTGGTGAGCTCCTGGGGCAGGTAGAGAAGGCCCTGT 

1261 1271 1281 1291 1301 13U 

* * . * * * * 
TCGAGCTCGGGGCAGGGTGGTAGAACAGCTGCAACAGGGACAGGTCTTGCCAGATTAGTG 

1321 1331 1341 

* * * 
CCTTCCATCAGTTGGCGGAATTCG f 

0 

a third sequence 



.5 



5 



0 



1 


CGGTTGTTCA 


GTACCAGTTT 


51 


CGCTCTATCA 


CCCAAGCCGA 


101 


GTTGCGTAAT 


GCCTGGCGCC 


151 


CTGCCGCCAG 


AGTCACCCAG 


201 


TCCCTCCCCC 


CTCACCTGCT 


251 


CCACCTTCTT 


GGCGCCCCGA 


301 


CTGGGCTCGT 


CCCCGCCATC 


351 


ATGTTACCCA 


TCGCTGCCCT 


401 


TACCCCATGA 


TCCAGACCAC 


451 


TGAGCCTGCC 


GTCGGGCAGA 


501 


CCAACCCCGG 


CTCAGTGACG 


551 


GAGACCACTA 


TTATTGCCAC 


601 


TCGGGCTCAT 


GCCATTGTTG 


651 


TCAJTGACGC 


ACCAGGCCTG 


701 


GTTAATAACT 


TTTTCCTCGC 



751 AGTTATTCCC CGTGGCAACC CTGACGCCAA TGTTGACACC CTGGCTGCCT 
801 TCCCGCCGTC TTGCCAGATT AGTGCCTTCC ATCAGTTGGC TGAGGAGCTT 
851 GGCCACAGAC CTGTCCCTGT TGCAGCTGTT CTACCACCCT GCCCCGAGCT 
901 CGAACAGGGC CTTCTCTACC TGCCCCAGGA GCTCACCACC TGTGATAGTG 
951 TCGTAACATT TGAATTAACA GACATTGTGC ACTGCCGCAT GGCCGCCCCG 
1001 AGCCAGCGCA AGGCCGTGCT GTCCACACTC GTGGGCCGCT ACGGCGTCGC 
1051 ACAAAGCTCT ACAATGCTTC CCACTCTGAT GTTCGCGACT CTCTCGCCCG 
1101 TTTTATCCCG GCCATTGGCC CCGTACAGGT TACAACTTGT GAATTGTACG 
1151 AGCTAGTGGA GGCCATGGTC GAGAAGGGCC AGGATGGCTC CGCCGTCCTT 
1201 GAGCTTGATC TTTGCAACCG TGACGTGTCC AGGATCACCT TCTTCCAGAA 
1251 AGATTGTAAC AAGTTCACCA CAGGTGAGAC CATTGCCCAT GGTAAAGTGG 
1301 GCCAGGGCAT CTCGGCCTGG AGCAAGACCT TCTGCGCCCT CTTTGGCCCT 
1351 TGGTTCCGCG CTATTGAGAA GGCTATTCTG GCCCTGCTCC CTCAGGGTGT 
1401 GTTTTACGGT GATGCCTTTG ATGACACCGT CTTCTCGGCG GCTGTGGCCG 
1451 CAGCAAAGGC ATCCATGGTG TTTGAGAATG ACTTTTCTGA GTTTGACTCC 
1501 ACCCAGAATA ACTTTTCTCT GGGTCTAGAG TGTGCTATTA TGGAGGAGTG 
1551 TGGGATGCCG CAGTGGCTCA TCCGCCTGTA TCACCTTATA AGGTCTGCGT 
1601 GGATCTTGCA GGGCCCGAAG GAGTCTCTGC GAGGGTTTTG GAAGAAACAC 
1651 TCCGGTGAGC CCGGCACTCT TCTATGGAAT ACTGTCTGGA ATATGGCCGT 
1701 TATTACCCAC TGTTATGACT TCCGCGATTT TCAGGTGGCT GCCTTTAAAG 
1751 GTGATGATTC GATAGTGCTT TGCAGTGAGT ATCGTCAGAG TCCAGGAGCT 
1801 GCTGTCCTGA TCGCCGGCTG TGGCTTGAAG TTGAAGGTAG ATTTCCGCCC 
1851 GATCGGTTTG TATGCAGGTG TTGTGGTGGC CCCCGGCCTT GGCGCGCTCC 
1901 CTGATGTTGT GCGCTTCGCC GGCCGGCTTA CCGAGAAGAA TTGGGGCCCT 
1951 GGCCCTGAGC GGGCGGAGCA GCTCCGCCTC GCTGTTAGTG ATTTCCTCCG 
2001 CAAGCTCACG AATGTAGCTC AGATGTGTGT GGATGTTGTT TCCCGTGTTT 
2051 ATGGGGTTTC CCCTGGACTC GTTCATAACC TGATTGGCAT GCTACAGGCT 
2101 GTTGCTGATG GCAAGGCACA TTTCACTGAG TCAGTAAAAC CAGTGCTCGA 
2151 CTTGACAAAT TCAATCTTGT GTCGGGTGGA ATGAATAACA TGTCTTTTGC 
2201 TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT TGTTGCTGCT 
2251 CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 
2301 GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT 
2351 GACCGGGTTG ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA 
2401 CCCCTTCGCC CCCGATGTCA CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC 
2451 GCCAACCCGC CCGACCACTC GCGTCCGCTT GGCGTGACCA GGCCCAGCGC 
2501 CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGCT 
2551 AACCGCGGTC GCTCCGGCCC CG, 
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or a fourth sequence 

1 GCGGCCGCTC GGTGGGGTTC CACCTCTCTT CCCATTCCGA ACAGAGAAGT 
51 TAAGCAAGGA ATTAATTCGC GGCCGCTCGT GTTGCGTGAG GTGGGTATCT 
101 CAGATGCCAT TGTTAATAAT TTCTTCCTTT CGGGTGGCGA GGTTGGTCAC 
151 CAGAGACCAT CGGTCATTCC GCGAGCAACC CTGACCGCAA TGTTGACGTG 
201 CTTGCGGCGT TTCCACCTTC ATGCCAAATA AGCGCCTTCC ATCAGCTTGC 
250 TGAGGAGCTG GGCCACCGGC GGCGCCGGTG CTGTGCTACC TCCCTGCCCT 
301 GAGCTTGAGC AGGGCCTTCT CTATCTGCCA CAGGAGCTAG CCTCCTGTGA 
350 CAGTGTTGTG ACATTTGAGC TAACTGACAT TGTGCACTGC CGCATGGCGG 
401 CCCCTAGCCA AAGGAAAGCT GTTTTGTCCA CGCTGGTAGG CCGGTATGGC 
451 AGACGCACAA GGCTTTATGA TGCGGGTCAC ACCGATGTCC GCGCCTCCCT 
501 TGCGCGCTTT ATTCCCACTC TCGGGCGGGT TACTGCCACC ACCTGTGAAC 
15 551 TCTTTGAGCT TGTAGAGGCG ATGGTGGAGA AGGGCCAAGA CGGTTCAGCC 

601 GTCCTCGAGT TGGATTTGTG CAGCCGAGAT GTCTCCCGCA TAACCTTTTT 
651 CCAGAAGGAT TGTAACAAGT TCACGACCGG CGAGACAATT GCGCATGGCA 
701 AAGTCGGTCA GGGTATCTCC GCCTGGAGTA AGACCTTTTG TGCCCTGTTT 
751 GGCCCCTGGT TCCGTTGCGA TTGAGAAGGC TATTCTATCC CTTTTACCAC 
801 AAGCTGTGTT CTACGGGGAT GCTTATGACG ACTCAGTATT CTCTGCTGCC 
851 GTGGCTGGCG CCAGCCATGC CATGGTGTTT GAAAATGATT TTTCTGAGTT 
901 TGACTCGACT CAGAATAACT TTTCCCTAGG TCTTGAGTGC GCCATTATGG 
951 AAGAGTGTGG TATGCCCCAG XGGCTTGTCA GGTTGTACCA TGCCGTCCGG 
1001 TCGGCGTGGA TCCTGCAGGC CCCAAAAGAG TCTTTGAGAG GGTTCTGGAA 
25 1051 GAAGCATTCT GGTGAGCCGG GCACGTTGCT CTGGAATACG GTGTGGAACA 

1101 TGGCAATCAT TGCCCATTGC TATGAGTTCC GGGACCTCCA GGTTGCCGCC 
1151 TTCAAGGGCG ACGACTCGGT CGTCCTCTGT AGTGAATACC GCCAGAGCCC 
1201 AGGCGCCGGT TCGCTTATAG CAGGCTGTGG TTTGAAGCCG AAGGCTGACT 
1251 TCCGGCCGAT TGGGCTGTAT GCCGGGGTTG TCGTCGCCCC GGGGCTCGGG 
30 1301 GCCCTACCCG ATGTCGTTCG ATTCGCCGGA CGGCTTTCGG AGAAGAACTG 

1351 GGGGCCTGAT CCGGAGCGGG CAGAGCAGCT CCGCCTCGCC GTGCAGGATT 
1401 TCCTCCGTAG GTTAACGAAT GTGGCCCAGA TTTGTGTTGA GGTGGTGTCT 
1451 AGAGTTTACG GGGTTTCCCC GGGTCTGGTT CATAACCTGA TAGGCATGCT 
1501 CCAGACTATT GGTGATGGTA AGGCGCATTT TACAGAGTCT GTTAAGCCTA 
35 1551 TACTTGACCT TACACACTCA ATTATGCACC GGTCTGAATG AATAACATGT 

1601 GGTTTCCTGC GCCCATGGGT TCGCCACCAT GCGCCCTAGG CCTCTTTTGC 
1651 CGAGCGGCCG C, 



20 
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or a sequence complementary thereto. 

12, The DNA fragment of claim 11, wherein said fragment 
5 contains a coding sequence homologous to nucleotides 2 

through 101 of said first sequence, nucleotides 594 
through 643 of said second sequence, or a sequence 
complementary to said coding sequences. 

10 13. A kit comprising, in a container or separate containers, 
a pair of single-strand primers derived from non- 
homologous regions of opposite strands of a DNA duplex 
fragment derived from an enterically transmitted viral 
hepatitis agent whose genome contains a region which is 

15 homologous to the 1.33 kb DNA EcoRI insert present in 

plasmid pTZ-KPl(ETl.l) carried in E. coli strain BB4 and 
having ATCC deposit no. 67717. 

14. The kit of claim 13, which are derived from opposite 
20 strands of the EcoRI duplex insert in said plasmid. 

15. A method for detecting the presence of an enterically 
transmitted nonA/nonB hepatitis viral agent in a 
biological sample, comprising 

25 preparing a mixture of duplex DNA 

fragments derived from the sample/ 
denaturing the duplex fragments, 
adding to the denatured DNA fragments, a 
pair of single-strand primers derived from non- 
30 homologous regions of opposite strands of a DNA duplex 
fragment derived from an enterically transmitted viral 
hepatitis agent whose genome contains a region which is 
homologous to the 1.33 kb DNA EcoRI 'insert present in 
plasmid pTZ-KPl(ETl.l) carried in E. coli strain BB4, 
35 and having ATCC deposit no. 67717/ 

hybridizing said primers to homologous- 
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sequence region of opposite strands of such duplex DNA 
fragments derived from enterically transmitted nonA/nonB 
hepatitis agent, 

reacting the primed fragment strands with DNA 
polymerase in the. presence of DNA nucleotides, to form 
new DNA duplexes containing the primer sequences, and 

repeating said denaturing, adding, hybridizing 
and reacting steps, until a desired degree of 
amplification of sequences is achieved. 

16. The method of claim 15, wherein the primers are derived 
from opposite strands the EcoRI duplex insert in said 
plasmid. 

15 17. The method of claim 15, for detecting the presence of 

viral agent in a sample of cultured cells infected with 
the agent. 



10 



20 



18. A vaccine for immunizing an individual against 

enterically transmitted nonA/nonB hepatitis viral agent 
comprising, in a pharmacologically acceptable adjuvant, 
a recombinant protein derived from an enterically 
transmitted nonA/nonB viral hepatitis agent whose genome 
contains a region which is homologous to the 1.33 kb DNA 
25 EcoRI insert present in plasmid pTZ-KFl(ETl.l) carried 

in B. coli strain BB4, and having ATCC deposit no. 
67717 . 

19 v The vaccine of claim 18, wherein the protein is derived 
30 from the EcoRI insert in said plasmid. 

20. in a method of isolating an enterically transmitted non- 
A/nonB viral agent or a nucleic acid fragment produced 
by the agent, an improvement which comprises: 
35 utilizing, as a source of said agent, bile obtained 

from a human or cynomolgus monkey having an active 
infection of enterically transmitted non-A/non-B 
hepatitis. 
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21. The method of claim 20, wherein the bile is obtained 
from an infected cynomologus monkey. 

5 22. Human polyclonal anti-serum obtained from a human 

immunized with a protein derived form an enterically 
transmitted non-A/non-B viral hepatitis agent whose 
genome contains a region which is homologous to the 1.33 
kb DNA EcoRl insert present in plasraid pTZ-KPl ( ET1 . 1 ) 

0 carried in E. coli strain BB4 and having ATCC deposit 

no. 67717. 



15 



20 



25 



30 



35 
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